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In our time, the ability to use and manipulate 
Information has become the single most 
important element of success, no matter how 
you measure it: financial security, power, or 
Status. Those who work by manipulating 
ideas and abstractions are the leaders and ben- 
eficiaries of our society. In such an era, high 
intelligence is an increasingly precious raw 
material for success. But despite decades of 
fashionable denial, the overriding and insis¬ 
tent truth about intellectual ability is that it is 
endowed unequally, for reasons that govern- 
ment policies can do little to change. 

The major purpose of The Bell Curve is to 
reveal the dramatic transformation that is 
currently in process in American society—a 
process that has created a new kind of dass 
structure led by a “cognitive elite,” itself a 
result of concentration and self-selection in 
those social pools well endowed with cognitive 
abilities. This transformation, sadly, has its 
opposite: the perpetuation of a dass of people 
deficient in these endowments and abilities, 
and increasingly doomed to labor, if they find 
work at all, outside the information economy. 

In a book that is certain to ignite an explosive 
controversy, Herrnstein and Murray break new 
ground in exploring the ways that low intelli¬ 
gence, independent of social, economic, or ethnic 
Sackground lies at the root of many of our social 
problems. The authors also demonstrate the 
truth of another taboo fact: that intelligence lev- 
els differ among ethnic groups. This finding is 
already well-known and widely discussed among 
psychometricians and other scholars. In The Bell 
Curve , Hermstein and Murray open this body of 
scholarship to the general public. 

Our public policy refuses to acknowledge the 
proofs of human difference, or to deal with its 
consequences. With relentless and unassailable 
thoroughness, Herrnstein and Murray for the 
first time show that for a wide ränge of 
intractable social problems, the decisive correla- 
tion is between a high incidence of the problem 
and the low intelligence of those who suffer 
from it: this holds for school dropouts, unem- 


ployment, work-related injury, out of wedlock 
births, crime, and many other social problems. 
Though we stubbornly deny it, these social 
problems correlate to a significant degree with 
intelligence. 

Only by facing up to this undeniable news 
can we begin to accurately assess the nations 
problems and make realistic plans for address- 
ing them. That means in the first instance 
accepting that there are great differences in 
intelligence between groups of people, as well 
as among individuals in any group. Just as 
important, it also means learning that these 
group differences do not justify prejudicial 
assumptions about any member of a given 
group whose intelligence and potential may, in 
fiict, be anywhere under the bell curve of intel¬ 
ligence from the dullest to the most brilliant. 
But it does mean we must have the courage to 
revise what we can talk about in public. This 
book is the first important Step toward that 
difficult but necessary goal. 
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The re is a most absurd and audacious Method of reasoning avowed by 
so me Bigots and Enthusiasts, and through Fear assented to by some wiser 
and better Men; it is this. They argue against a fair Discussion of pop¬ 
ulär Prejudices, because, say they, tho’ they would be found without any 
reasonable Support, yet the Discovery might be productive of the most 
dangerous Consequences. Absurd and blavsphemous Notion! AvS if all 
Happiness was not connected with the Practice of Virtue, which nec- 
essarily depends upon the Knowledge ofTruth. 


Edmund Burke 
A Vindication of Natural Society 
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A Note to the Reader 


We have designed The Bell Cuwe to be read at several levels. 

At the simplest level, it is only about thirty pages long. Each chap- 
ter except the Introduction and the final two chapters opens with a 
precis of the main findings and conclusions minus any evidence for 
them, written in an informal style free of technical terms. You can get 
a good idea of what we have to say by reading just those introductory 
essays. 

The next level is the main text. It is accessible to anyone who en- 
joys reading, for example, the Science section of the news magazines. 
No special knowledge is assumed; everything you need to know to fob 
low all of the discussion is contained within the boolc. The main text 
does include considerable technical material, however. The documen- 
tation becomes especially extensive when we come to a topic so con- 
troversial that many readers will have a “This can’t possibly be true” 
reaction. 

Sprinkled throughout the book are boxes that add more detail, 
discuss alternative ways of thinking about the data, or relate tidbits 
that don’t quite fit in the text. You may skip any of these without 
interrupting the flow of the narrative, but we think they add some- 
thing (or they wouldn’t be there), and we encourage you to dip into 
them. 

The endnotes provide the usual scholarly references. Some of them, 
indicated in text by endnote numbers enclosed in brackets, add short 
discussions that will be of interest mostly to specialists. 

Finally, the appendixes elaborate on key issues. For example, readers 
who come to the book unfamiliar with statistics will find that Appen¬ 
dix 1 supplies the basics; if you want to know more about the debate 
over cultural bias in intelligence tests, Appendix 5 guides you through 
the literature on that issue; and so on. Other appendixes lay out the Sta¬ 
tistical detail that could not be fit into the main text and was too bulky 
for a note. 


xx Note to Reader 


Regarding those pesky impersonal third-person singulär pro- 
nouns and other occasions when the authors must assign a gender to a 
fictitious person used to illustrate a point, it seems to us there is a sim¬ 
ple, fair solution, which we hereby endorse: Unless there are obvious 
reasons not to, use the gender of the first author. We use he throughout. 


Preface 


This Book is about differences in intellectual capacity among people and 
groups and what those differences meanfor Americasfuture. The rela- 
tionships we will be discussing are among the most sensitive in Con¬ 
temporary America—so sensitive that hardly anyone writes or talks 
about them in public. It is not for lack of Information, as you will see. 
On the contrary, knowledge about the Connections between inteil i- 
gence and American life has been accumulating for years, available in 
the joumals held by any good university lihrary and on the Computer 
tapes and disks of public use databases. 

People have shied from the topic for many reasons. Some think that 
the concept of intelligence has been proved a fraud. Others recall to¬ 
talitären eugenic schemes based on IQ scores or worry about such 
schemes arising once the subject breaks into the open. Many fear that 
discussing intelligence will promote racism. 

The friends and colleagues whose concerns we take most seriously say 
something like this: “Yes, we acknowledge that intelligence is impor¬ 
tant and that people differ. But the United States is founded on the prin- 
ciple that people should be equal under the law. So what possible 
relevance can individual differences in intelligence have to public pol- 
icy? What good can come of writing this book?” In ans wer, we ask these 
friends and you, the reader, to share for a moment our view of the Situ¬ 
ation, perhaps suppressing some doubts and assuming as true things that 
we will subsequentiy try to prove are true. Here is our story: 

A great nation, founded on principles of individual liberty and Self¬ 
government that constitute the crowning achievement of statecraft, 
approaches the end of the twentieth Century. Equality of rights— 
another central principle—has been implanted more deeply and more 
successfully than in any other society in history. Yet even as the princi¬ 
ple of equal rights triumphs, stränge things begin to happen to tvvo small 
segments of the population. 

In one segment, life gets better in many ways. The people in this group 
are welcomed at the best Colleges, then at the best graduate and profes- 





xxii Preface 


Preface xxiii 


sionaL schools, regardless of their parents 1 wealth. After they complete 
their education, they enter fulfiliing and prestigious careers. Their in- 
comes continue to rise even when income growth stagnates for every- 
one eise. By their maturity, these fortunate ones commonly have 
six-figure incomes. Technology works in their behalf, expanding their 
options and their freedom, putting unprecedented resources *at their 
command, enhancing their ability to do what they enjoy doing. And as 
these good things happen to them, they gravitate to one another, in- 
creasingly enabled by their affluence and by technology to werk to- 
gether and live in one another’s Company—and in isolation from 
everybody eise. 

In the other group, life gets worse, and its members collect at the hot- 
tom of society. Poverty is severe, drugs and crime are rampant, and the 
traditional family all but disappears. Economic growth passes them by. 
Technology is not a partner in their lives but an electronic opiate. They 
live together in urban centers or scattered in rural backwaters, but their 
presence hovers over the other parts of town and countryside as well, 
creating fear and resentment in the rest of society that Ls seldom openly 
expressed but festers nonetheless. 

Pressures from these contrasting movements at the opposite ends of 
society put terrific stress on the entire structure. The mass of the nation 
belongs to neither group, but their lives are increasingly shaped by the 
power of the fortunate few and the plight of the despairing few. The cul- 
ture’s sense of what is right and wrong, virtuous and mean, attainable 
and unattainable—most important, its sense of how people are to live 
together—is altered in myriad ways. The fragile web of civility, mutual 
regard, and mutual obligations at the heart of any happy society begins 
to tear. 

In trying to think through what is happening and why and in trying 
to understand thereby what ought to be done, the nation’s social seien- 
tists and journalists and politicians seek explanations. They examine 
changes in the economy, changes in demographics, changes in the cul- 
ture. They propose Solutions founded on better education, on more and 
better jobs, on specific social interventions. But they ignore an under- 
lying element that has shaped the changes: human intelligence—the 
way it varies within the American population and its cruciaily chang- 
ing role in our destinies during the last half of the twentieth Century. To 
try to come to grips with the nation’s problems without understanding 
the role of intelligence is to see through a glass darkly indeed, to grope 


with Symptoms instead of causes, to stumble into supposed remedies that 
have no chance of working. 

We are not indifferent to the ways in which this book, wrongly con- 
strued, might do harm. We have worried about them from the day we 
ser to work. But there can be no real progress in solving Americas so¬ 
cial problems when they are as misperceived as they are today. What 
good can come of understanding the relationship of intelligence to so¬ 
cial structure and public policy? Little good can come without it. 




Acknowledgments 


The first thing that made this book possible was our good fortune in be- 
ing where we are—Harvard University and the American Enterprise In¬ 
stitute. Different as they are in many ways, these two splendid 
institutions gave us a gift that is too often taken for granted: freedom 
and support to pursue our inquiries wherever they took us. 

For learned and often Creative advice for dealing with methodologi- 
cal issues, our thanks go to James Heckman, Derek Neal, Robert Rosen- 
thal, Donald Rubin, Christopher Winship > and the Harvard Psychology 
Department Seminar on the Analysis of Psychological Data. The staff 
of the Center for Human Resource Research—Paula Baker must be sin- 
gled out—were unfailingly ready to answer our questions about the in- 
tricacies of the National Longitudinal Survey of Youth. In the world of 
psychometrics and testing, we benefited especially from the advice of 
Robert Forsyth, Arthur Jensen, Richard Lynn, Len Ramist, Malcolm 
Ree, and Frank Schmidt. Others who have read chapters or listened to 
us try to explain what we’re up to and then have offered advice, or at 
least raised their eyebrows usefully, are Douglas Besharov, Eileen 
Blumenthal, John Bunzel, John Dilulio, Christopher DeMuth, David 
Ellwood, Richard Epstein, Ronald Haskins, James Herrnstein, Karl 
Hess, Michael Horowitz, Fred Jewett, Michael Joyce, Cheryl Keller, 
Robert Klitgaard, Irving Kristol, Richard Larry, Marlyn Lewis, Glenn 
Loury, Harvey Mansfield, Frank Miele, Michael Novak, Warren Reed, 
Daniel Seligman, Irwin Stelzer, Joan Kennedy Taylor, Abigail Thern- 
strom, Daniel Vining, Dean Whitla, James Wilson, and Douwe Yntema. 
Along with some of those already mentioned, John Bishop, Philip Cook, 
George Borjas, Robert Frank, Sandra Scarr, and Richard Weinberg pro- 
vided us with drafts of their ongoing work that filled in missing pieces 
of the puzzle. 

We took unblushing advantage of the energy and good nature of re- 
search assistants, with Kris Kirby and Ira Camahan taking on some huge 
tasks early in the work, followed by the invaluable help of Kristin 



xx vi Acknowledgments 


Caplice, Joseph Fazioli, and one other who asked to remain anonymous 
out of his wish to preserve, as he put it, a viable political future. 

All of these people, and others whom we have surely forgotten over 
the years, deserve our thanks for their help, but not the risk of being 
held accountable for what we’ve done with it. 

We were lucky to have Erwin Glikes as our editor and Amanda Ur¬ 
ban as our agent—the best in the business, and utterly undaunted by 
thehazards of this enterprise. The BellCurve is Erwins title, and its man- 
uscript was the last he completed editing before his death. 

In a list of acknowledgments, we would be remiss not to mention each 
other. We decided to write this book in November of 1989, and since 
the spring of 1990 have been working at it nearly full time, for long 
Stretches much more than full time, in continual collaboration. Our 
partnership has led to a more compe lling intellectual ad venture and a 
deeper friendship than we could have imagined. Authorship is alpha- 
betical; the work was symbiotic. 

At last, hovv shall we express our thanks to Susan Herrnstein and 
Catherine Cox? They could not be, and were not, overjoyed to see their 
husbands embark on a project as controversial as this one. But they have 
endured our preoccupation with the book and its intrusion into our fam- 
ilies* lives with love and humor. Beyond that, each of them read the en- 
tire manuscript, persuasively but affectionately pressing on us the 
benefit of their fine judgment, their good sense, and their honesty. In 
more ways than one, this book could not have been written without 
their help. 

Richard J. Herrnstein 
Charles Murray 
3 June 1994 


Introduction 


That the word intelligence describes something real and that it varies 
from person to person is as universal and ancient as any understanding 
about the state of being human. Literate cuLtures everywhere and 
throughout history have had words for saying that some people are 
smarter than others. Given the survival value of intelligence, the con- 
cept must be still older than that. Gossip about who in the tribe is clever- 
est has probably been a topic of conversation around the fixe since fires, 
and conversation, were invented. 

Yet for the last thirty years, the concept of intelligence has been a 
pariah in the world of ideas. The attempt to measure it with tests has 
been variously dismissed as an artifact of racism, political reaction, Sta¬ 
tistical bungling, and scholarly fraud. Many of you have reached this 
page assuming that these accusations are proved. In such a context 
comes this book, blirhely proceeding on the assumption that intelli¬ 
gence is a reasonably well-understood construct, measured with accu- 
racy and fairness by any number of standardized mental tests. The rest 
of this book can he better followed if you first understand why we can 
hold such apparently heterodox views, and for this it is necessary to 
know something about the story of measured intelligence. 

INTELLIGENCE ASCENDANT 

Variation in intelligence became the subject of productive scientific 
study in the last half of the nineteenth Century, stimulated, like so many 
other intellectual developments of that era, by Charles Darwin’s theory 
of evolution. Darwin had asserted that the transmission of inherited in¬ 
telligence was a key Step in human evolution, driving our simian an- 
cestors apart from the other apes. Sir Francis Galton, Darwin's young 
cousin and already a celebrated geographer in his own right, seized on 
this idea and set out to demonstrate its continuing relevance hy using 
the great families of Britain as a priniary source ot data. He presented 
evidence that intellectual capacity of various sorts ran in families in 



2 Iniroduction 


Introduction 3 


Hereditär y Genius, published just a decade after the appearance of Ori¬ 
gin of S pecies in 1859. So began a long and deeply controversial associ- 
ation between intelligence and heredity that remains with us today. 

Galton realized that he needed a precise, quantitative measure of the 
mental qualities he was trying to analyze, and thus he was led to put in 
formal terms what most people had always taken for granted: People 
vary in their intellectual abilities and the differences matter, to them 
personally and to society. 2 Not only are some people smarter than oth- 
ers, said Galton, but each person’s pattem of intellectual abilities is 
unique. People differ in their talents, their intellectual strengths and 
weaknesses, their preferred fonns of imagery, their mental vigor. 

Working from these observations, Galton tried to devise an intelli¬ 
gence test as we understand the term today: a set of items probing in¬ 
tellectual capacities that could be graded objectively. Galton had the 
idea that intelligence would surface in the form of sensitivity of per- 
ceptions, so he constructed tests that relied on measures of acuity of 
sight and hearing, sensitivity to slight pressures on the skin, and speed 
of reaction to simple Stimuli. His tests failed, but others followed where 
Galton had led. His most influential immediate successor, a French psy¬ 
chologist, Alfred Binet, soon developed questions that attempted to 
measure intelligence by measuring a person’s ability to reason, draw 
analogies, and identify patterns. 3 These tests, crude as they were by mod¬ 
ern Standards, met the key criterion that Galton’s tests could not: Their 
results generally accorded with common understandings of high and low 
intelligence. 

By the end of the nineteenth Century, mental tests in a form that 
we would recognize today were already in use throughout the British 
Commonwealth, the United States, much of Continental Europe, and 
Japan. 141 Then, in 1904, a former British Army officer named Charles 
Spearman made a conceptual and Statistical breakthrough that has 
shaped both the development and much of the methodological contro- 
versy about mental tests ever since. 5 

By that time, considerable progress had been made in statistics. Un- 
like Galton in his early years, investigators in the early twentieth Cen¬ 
tury had available to them an invaluable number, the correlation 
coefficient first devised by Gal ton himself in 1888 and elaborated by his 
disciple, Karl Pearson. 6 Before the correlation coefficient was available, 
scientists could observe that two variables, such as height and weight, 
seemed to vary together (the taller the heavier, by and large), but they 


had no way of saying exactly how much they were related. With Pear- 
son’s r, as the coefficient was labeled, they now could specify “how much” 
of a relationship existed, on a scale ranging from a minimum of-1 (for 
perfectly inverse relationships) to +1 (for perfectly direct relationships). 

Spearman noted that as the data from many different mental tests 
were accumulating, a curious result kept turning up: If the same group 
of people took two different mental tests, anyone who did well (or 
poorly) on one test tended to do similarly well (or poorly) on the other. 
In Statistical terms, the scores on the two tests were positively corre- 
lated. This outcome did not seem to depend on the specific content of 
the tests. As long as the tests involved cognitive skills of one sort or an- 
other, the positive correlations appeared. Furthermore, individual items 
within tests showed positive correlations as well. If there was any cor¬ 
relation at all between a pair of items, a person who got one of them 
right tended to get the other one right, and vice versa for those who got 
it wrong. In fact, the pattem was stronger than that. It tumed out to be 
nearly impossible to devise items that plausibly measured some cogni¬ 
tive skill and were not positively correlated with other items that plau- 
sibly measured some cognitive skill, however disparate the pair of skills 
might appear to be. 

Tire size of the positive correlations among the pairs of items in a test 
did vary a lot, however, and it was this combination—positive correla¬ 
tions throughout the correlation matrix, but of varying magnitudes— 
that inspired Spearman’s insight. 171 Why are almost all the correlations 
positive? Spearman asked. Because, he answered, they are tapping into 
the same general trait. Why are the magnitudes different? Because some 
items are more closely related to this general trait than others. 181 

Spearman’s Statistical method, an early example of what has since be- 
come known as factor analysis, is complex, and we will explore some of 
those complexities. But, for now, the basis for factor analysis can be read- 
ily understood. Insofar as two items tap into the same trait, they share 
something in common. Spearman developed a method for estimating 
how much sharing was going on in a given set of data. From almost any 
such collection of mental or academic test scores, Spearman’s method 
of analysis uncovered evidence for a unitary mental factor, which he 
named g, for “general intelligence.” The evidence for a general factor in 
intelligence was pervasive but circumstantial, based on Statistical analy¬ 
sis rather than direct observation. Its reality therefore was, and remains, 
arguable. 
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Spearman then made another major contribution to the study of in- 
telligence by defining what this mysterious g represented. He hypothe- 
sized that g is a general capacity for inferring and applying relationships 
drawn from experience. Being able to grasp, for example, the relation- 
ship between a pair of words like harvest and yield , or to recite a list of 
digits in reverse Order, or to see what a geometrical pattem would look 
like upside down, are examples of tasks (and of test items) that draw an 
g as Spearman conceived of it. This definition of intelligence differed 
subtly from the more prevalent idea that intelligence is the ability to 
leam and to generalize what is learned. The course of learning is affected 
by intelligence, in Spearman’s view, but it was not the thing in itself. 
Spearmanian intelligence was a measure of a person’s capacity for com- 
plex mental work. 

Meanwhile, other testers in Europe and America continued to refine 
mental measurement. By 1908, the concept of mental level (later called 
mental age) had been developed, followed in a few years by a slightly 
more sophisticated concept, the intelligence quotient. IQ at first was 
just a way of expressing a person’s (usually a child’s) mental level rela¬ 
tive to his or her contemporaries. Later, as the uses of testing spread, IQ 
became a more general way to express a person’s intellectual perfor- 
mance relative to a given population, Already by 1917, soon after the 
concept of IQ was first defined, the U.S. Army was administering in¬ 
telligence tests to classify and assign recruits for World War 1. Within a 
few years, the letters “IQ” had entered the American vernacular, where 
they remain today as a universally understood synonym for intelligence. 

To this point, the study of cognitive abilities was a success story, rep- 
resenting one of the rare instances in which the new soft Sciences were 
able to do their work with a rigor not too far short of the Standards of 
the traditional Sciences, A new specialty within psychology was created, 
psychometrics. Although the debates among the psychometricians were 
often fierce and protracted, they produced an expanded understanding 
of what was involved in mental capacity, The concept ofgsurvived, em- 
bedded in an increasingly complex theory of the structure of cognitive 
abilities. 

Because intelligence tests purported to test rigorously an important 
and valued trait about people (including ourselves and our loved ones), 
IQ also became one of the most visible and controversial products of so¬ 
cial Science. The first wave of public controversy occurred during the 
first decades of the Century, when a few testing enthusiasts proposed us- 


ing the results of mental tests to support outrageous racial policies. Ster- 
ilization laws were passed in sixteen American States between 1907 and 
1917, with the elimination of mental retardation being one ofthe prime 
targets of the public policy. “Three generations of imbeciles are enough,” 
Justice Oliver Wendell Holmes declared in an opinion upholding the 
constitutionality of such a law. 9 It was a Statement made possible, per- 
haps encouraged, by the new enthusiasm for mental testing. 

In the early 1920s, the chairman of the House Committee on Immi¬ 
gration and Naturalization appointed an “Expert Eugenical Agent” for 
his committee’s work, a hiologist who was especiaily concemed about 
keeping up the American level of intelligence by suitable immigration 
policies. 10 An assistant professor of psychology at Princeton, Carl C. 
Brigham, wrote a book entitled A Study of American Intelligence using 
the results of the U.S. Army’s World War I mental testing program to 
conclude that an influx of immigrants from Southern and eastern Eu¬ 
rope would lower native American intelligence, and that immigration 
therefore should he restricted to Nordic stock (see the box about tests 
and immigration). 11 


Fact and Fiction About Immigration and Intelligence Testing 


Two stories about early IQ testing have entered the folklore so thoroughly 
that people who know almost nothing eise about that history bring them 
up at the beginning of almost any discussion of IQ. The first story is that 
Jews and other Immigrant groups were thought to be below average in in¬ 
telligence, even feebleminded, which goes to show how untrustworthy 
such tests (and the testers) are. The other story is that IQ tests were used 
as the basis for the racist immigration policies of the 1920s, which shows 
how dangerous such tests (and the testers) are. 12 

The first is hased on the work done at Ellis Island by H. H. Goddard, 
who explicitly preselected his sample for evidence of low intelligence (his 
purpose was to test his test’s usefulness in screening for feeblemindedness), 
and did not try to draw any conclusions about the general distribution of 
intelligence in immigrant groups. 11 The second has a stronger circumstan- 
tial case: Brigham published his book just a year before Congress passed 
the Immigration Restriction Act of 1924, which did indeed tip the flow of 
immigrants toward the Western and northem Europeans, The difficulty 
with making the causal case is that a close reading of the Hearings for the 
hi ll shows no evidence that Brigham’s book in particular or IQ tests in gen¬ 
eral played any role. 14 
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Critics responded vocally. Young Walter Lippmann, already an in- 
fluential columnist, was one of the most prominent, fearing power-hun- 
gry intelligence testers who yearned to “occupy a position of power 
which no intellectual has held since the coliapse of theocracy.” 15 In a 
lengthy exchange in the New Republic in 1922 and 1923 with Lewis Ter- 
man, premier American tester of the time and the developer of the Stan- 
ford-Binet IQ test, Lippmann wrote, “I hate the impudence of a claim 
that in fifty minutes you can judge and classify a human being’s predes- 
tined fitness in life. I hate the pretentiousness of that claim. I hate the 
abuse of scientific method which it involves. I hate the sense of superi- 
ority which it creates, and the sense of inferiority which it imposes.” 16 

Lippmann’s characterization of the tests and the testers was some- 
times unfair and often factuaily wrong, as Terman energetically pointed 
out. 17 But while Terman may have won the technical arguments, Lipp¬ 
mann was right to worry that many people were eager to find Connec¬ 
tions between the results of testing and the more chilling implications 
of social Darwinism. Even if the psychometricians generally made müd¬ 
est ctaims for how much the tests predicted, it remained true that “IQ” 
—that single number with the memorable labe!—was seductive. As 
Lippmann feared, people did tend to give more credence to an individ- 
ual’s specific IQ score and make broader generalizations from it than was 
appropriate. And not least, there was plenty to criticize in the psycho- 
metricians’ results. The methods for collecting and analyzing quantita¬ 
tive psychological data were still new, and some basic inferential 
mistakes were made. 

If the tests had been fatally flawed or merely uninformative, they 
would have vanished, Why this did not happen is one of the stories we 
will be telling, but we may anticipate by observing that the use of tests 
endured and grew because society’s largest institutions—schools, mili- 
tary forces, industries, governments—depend significantly on measur- 
able individual differences. Much as some observers wished it were not 
true, there is often a need to assess differences between people as ob- 
jectively, fairly, and efficiently as possible, and even the early mental 
tests often did a better job of it than any of the alternatives. 

Düring the 1930s, mental tests evolved and improved as thetr use 
continued to spread throughout the world. David Wechsler worked on 
the initial version of the tests that would eventually become the Wechs¬ 
ler Adult Intelligence Scale and the Wechsler Intelligence Scale for 
Children, the famous WAIS and WISC. Terman and his associates pub- 


lished an improved version of the Stanford-Binet. But these tests were 
individually administered and had to be scored by trained personnel, 
and they were therefore too expensive to administer to large groups of 
people. Psychometricians and test publishers raced to develop group- 
administered tests that could be graded by machine. In the search for 
practical, economical measurements of intelligence, testing grew from 
a cottage industry to big business, 

World War II stimulated another major advance in the state of the 
art, as psychologists developed paper-and-pencil tests that could accu- 
rately identify specific military aptitudes, even ones that included a sig- 
nificant element of physical aptitude (such as an aptitude for flying 
airplanes). Shortly after the war, psychologists at the University of Min¬ 
nesota developed the Minnesota Multiphasic Personality Inventory, the 
first machine-gradable standardized test with demonstrated validity as 
a predictor of various personality disorders. Later came the California 
Psychological Inventory, which measured personality characteristics 
within the normal ränge—“social presence” and “self-control,” for ex- 
ample. The testing industry was flourishing, and the annual Mental Mea¬ 
surements Yearbook that cataloged the tests grew to hundreds of pages. 
Hundreds of millions of people throughout the world were being psy- 
chologically tested every year. 

Attacks on testing faded into the background during this period. 
Though some psychometricians must have known that the tests were 
capturing human differences that had unsettling political and social im¬ 
plications, no one of any stature was trying to use the results to promote 
discriminatory, let alone eugenic, laws. And though many intellectuals 
outside the testing profession knew of these results, the political agen- 
das of the 1940s and 1950s, whether of New Deal Democrats or Ei- 
senhower Republicans, were more pragmatic than ideological. Yes, 
intelligence varied, but this was a fact of life that seemed to have little 
bearing on the way public policy was conducted. 

INTELLIGENCE BESIEGED 

Then came the 1960s, and a new controversy about intelligence tests 
that continues to this day. It arose not from new findings but from a new 
outlook on public policy. Beginning with the rise of powerful social dem- 
ocratic and socialist movements after World War I and accelerating 
across the decades until the 1960s, a fundamental shift was taking place 
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in the received wisdom regarding equality. This was most evident in the 
political arena, where the civil rights movement and then the War on 
Poverty raised Americans’ consciousness about the nature of the in- 
equalities in American society. But the changes in outlook ran deeper 
and broader than politics. Assumptions about the very origins of social 
Problems changed profoundly. Nowhere was the shift more pervasive 
than in the field of psychology. 

Psychometricians of the 1930s had debated whether intelligence is 
almost entirely produced by genes or whether the environment also 
plays a role. By the 1960s and 1970s the point of contention had shifted 
dramatically. It had somehow become controversial to claim, especially 
in public, that genes had any effect at all on intelligence. Ironically, the 
evidence for genetic factors in intelligence had greatly strengthened 
during the very period when the terms of the debate were moving in the 
other direction. 

ln the psychological laboratory, there was a similar shift. Psycholog- 
ical experimenters early in the Century were, if anything, more likely to 
concentrate on the inborn pattems of human and animal behavior than 
on how the leaming process could change behavior. 18 But from the 
1930s to the 1960s, the leading behaviorists, as they were called, and 
their students and disciples were almost all specialists in leaming the- 
ory. They filled the technical journals with the results of leaming ex- 
periments on rats and pigeons, the tacit implication being that genetic 
endowment mattered so little that we could ignore the differences 
among species, let alone among human individuals, and still discover 
enough about the learning process to make it useful and relevant to 
human concerns. 19 There are, indeed, aspects of the learning process 
that cross the lines between species, but there are also enormous differ¬ 
ences, and these differences were sometimes ignored or minimized when 
psychologists explained their findings to the lay public. B. F. Skinner, at 
Harvard University, more than any other of the leading behaviorists, 
broke out of the academic world into public attention with books that 
applied the findings of laboratory research on animals to human soci¬ 
ety at large. 20 

To those who held the behaviorist view, human potential was almost 
perfectly malleable, shaped by the environment. The causes of human 
deficiencies in intelligence—or parenting, or social behavior, or work 
behavior—lay outside the individual. They were caused by flaws in so- 
ciety. Sometimes capitalism was blamed, sometimes an uncaring or in- 


competent govemment. Further, the causes of these deficiencies could be 
fixed by the right public policies—redistribution of wealth, better educa- 
tion, better housing and medical care. Once these environmental causes 
were removed, the deficiencies should vanish as well, it was argued. 

The contrary notion—that individual differences could not easily be 
diminished by government Intervention—collided head-on with the 
enthusiasm for egalitarianism, which itself collided head-on with a half- 
century of IQ data indicating that differences in intelligence are in- 
tractable and significantly heritable and that the average IQ of various 
socioeconomic and ethnic groups differs. 

ln 1969, Arthur Jensen, an educational psychologist and expert on 
testing from the University of California at Berkeley, put a match to this 
volatile mix of Science and ideology with an article in the Harvard E d- 
ucational Review. 21 Asked by the Review’s editors to consider why com- 
pensatory and remedial education programs begun with such high hopes 
during the War on Poverty had yielded such disappointing results, 
Jensen concluded that the programs were bound to have little success 
because they were aimed at populations of youngsters with relatively 
low IQs, and success in school depended to a considerable degree on IQ. 
IQ had a large heritable component, Jensen also noted. The article fur¬ 
ther disclosed that the youngsters in the targeted populations were dis- 
proportionately black and that historically blacks as a population had 
exhibited average IQs substantially below those of whites. 

The reaction to Jensens article was immediate and violent. From 
1969 through the mid-1970s, dozens of books and hundreds of articles 
appeared denouncing the use of IQ tests and arguing that mental abil- 
ities are determined by environment, with the genes playing a minor 
role and race none at all. Jensen’s name became synonymous with a con- 
stellation of hateful ways of thinking. u It perhaps is impossible to exag- 
gerate the importance of the Jensen disgrace,” wrote Jerry Hirsch, a 
psychologist specializing in the genetics of animal behavior who was 
among Jensens more vehement critics. “It has permeated both Science 
and the universities and hoodwinked large segments of government and 
society. Like Vietnam and Watergate, it is a Contemporary Symptom of 
serious affliction.” 22 The title of Hirsch’s article was “The Bankruptcy 
of ‘Science* Without Scholarship.” During the first few years after the 
Harvard Educational Review article was published, Jensen could appear 
in no public forum in the United States without triggering something 
perilously close to a riot. 
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The uproar was exacerbated by William Shockley, who had won the 
Nobel Prize in physics for his contributions to the Invention of the trän- 
sistor but had turned his attention to human Variation toward the end 
of his career. As eccentric as he was brilliant, he often recalled the eu- 
genicists of the early decades of the Century. He proposed, as a “thought 
exercise,” a scheme for paying people with low IQs to be sterilized. 23 He 
supported (and contributed to) a sperm bank for geniuses. He seemed 
to relish expressing sensitive scientific findings in a way that would out¬ 
rage or disturb as many people as possible. Jensen and Shockley, utterly 
unlike as they were in most respects, soon came to be classed together 
as a pair of racist intellectual cranks. 

Then one of us, Richard Hermstein, an experimental psychologist at 
Harvard, strayed into forbidden territory with an article in the Sep¬ 
tember 1971 Atlantic Monthly . 24 Herrnstein barely mentioned race, but 
he did talk about heritability of IQ. His proposition, put in the form of 
a syllogism, was that because IQ is substantially heritable, because eco¬ 
nomic success in life depends in part on the talents measured by IQ tests, 
and because social Standing depends in part on economic success, it fol- 
lows that social Standing is bound to be based to some extent on inher- 
ited differences. By 1971, this had become a controversial thing to say. 
In media accounts of intelligence, the names jensen, Shockley, and 
Hermstein became roughly interchangeable. 

That same year, 1971, the U.S. Supreme Court outlawed the use of 
standardized abiiity tests by employets unless they had a “manifest rela- 
tionship” to the specific job in question because, the Supreme Court 
held, standardized tests acted as “built-in headwinds” for minority groups, 
even in the absence of discriminatory intent. 25 A year later, the National 
Education Association called upon the nation’s schools to impose a 
moratorium on all standardized intelligence testing, hypothesizing that 
“a third or more of American citizens are intellectually folded, mutilated 
or spindled before they have a chance to get through elementary school 
because of linguistically or culturally biased standardized tests.” 26 A 
movement that had begun in the 1960s gained momentum in the early 
1970s, as major school Systems throughout the country, including those 
of Chicago, New York, and Los Angeles, limited or banned the use of 
group-administered standardized tests in public schools. A number of Col¬ 
leges announced that they would no longer require the Scholastic Apti- 
tude Test as part of the admissions process. The legal movement against 


tests reached its apogee in 1978 in the case of Larry P. Judge Robert Peck- 
ham of the U.S. District Court in San Francisco ruled that it was un- 
constitutional to use IQ tests for placement of children in classes for the 
educably mentally retarded if the use of those tests resulted in placement 
of “grossly disproportionate” numbers of black children. 27 

Meanwhile, the intellectual debate had taken anew and personalized 
tum. Those who claimed that intelligence was substantially inherited 
were not just wrong, the critics now discovered, they were charlatans as 
well. Leon Kamin, a psychologist then at Princeton, opened this phase 
of the debate with a 1974 book, The Science and Politics of IQ. Patrio- 
tism, we have been told, is the last refuge of scoundrels,” Kamin wrote 
in the opening pages. “Psychologists and biologists might consider the 
possibility that heritability is the first.” 28 Kamin went on to charge that 
mental testing and belief in the heritability of IQ in particular had been 
fostered by people with right-wing political views and racist social views. 
They had engaged in pseudoscience, he wrote, suppressing the data they 
did not like and exaggerating the data that agreed with their precon- 
ceptions. Examined carefully, the case for the heritability of IQ was nil, 
concluded Kamin. 

In 1976, a British journalist, Oliver Gillie, published an article in the 
London Sunda;y Times that seemed to confirm Kamin’s thesis with a sen- 
sational revelation: The recently deceased Cyril Burt, Britain s most em- 
inent psychometrician, author of the largest and most famous study of 
the intelligence of identical twins who grew up apart, was charged with 
fraud. 29 He had made up data, fudged his results, and invented coau- 
thors, the Sunddy Times declared. The subsequent scandal was as big as 
the Piltdown Man hoax. Cyril Burt had not been just another researcher 
but one of the giants of twentieth-century psychology. Nor could his 
colleagues find a ready defense (the defense came later, as described in 
the box). They protested that the revelations did not compromise the 
great bulk of the work that bore on the issue of heritability, but their de- 
fenses sounded feeble in the light of the suspicions that had preceded 
Burt’s exposure. 

For the public observing the uproar in the academy from the side- 
lines, the capstone of the assault on the integrity of the discipline oc- 
curred in 1981 when Harvard paleobiologist Stephen Jay Gould, author 
of several populär books on biology, published The M ismeasure of Man. 
Gould examined the history of intelligence testing, found that it was 




12 Introducüon 


lntroduction 13 


The Burt Affair 


It would be more than a decade before the Burt affair was subjected to de- 
tailed reexamination. In 1989 and 1991, two accounts of the Burt allega- 
tions, by psychologist Robert Joynson and sociologist Ronald Fletcher, 
written independently, concluded that the attacks against Burt had been 
motivated by a mixture of Professional and ideological antagonism and that 
no credible case of data falsificationor fictitious research or researchers bad 
ever been presented. 30 Botb authors also concluded that some of Burt’s 
leading critics were aware that their accusations were inaccurate even at 
the time they made them. An ironic afterword centers on Burt’s claim that 
the correlation between the IQs of identical twins reared apart is +.77. A 
correlation this large almost irrefutably Supports a large genetic influence 
on IQ. Since the attacks on Burt began, it had been savagely derided as 
fraudulent, the product of Burt’s fiddling with the data to make his case. 
In 1990, the Minnesota twin study, accepted by most scholars as a model 
of its kind, produced its most detailed estimates of the correlation of IQ 
between identical twins reared apart. The procedure that most closely par- 
alleled Burt’s yielded a correlation of + .78. 31 


peopled by charlatans, racists, and self-deluded fools, and concluded 
that “determinist arguments for ranking people according to a single 
scale of intelligence, no matter how numerically sophisticated, have 
recorded llttle more than social prejudice.” 33 The Mismeasure of Man 
became a best-seller and won the National Book Critics Circle 
Award. 

Gould and his allies had won the visible battle. By the early 1980s, a 
new received wisdom about intelligence had been formed that went 
roughly as follows: 

Intelligence is a bankrupt concept . Whatever it might mean—and nobody 
really knows even how to define it—intelligence is so ephemeral that no one 
can measure it accurately. IQ tests are, ofcourse, culturally biased, and 
so are all the other “aptitude” tests, such as the SAT. To the extent that 
tests such as IQ and SAT measure anything, it certainly is not an innate 
“ intelligence . ” IQ scores are not constant; they often change significantly 
over an individual’s life span . The scores ofentire populations can be ex- 
pected to change over time—look at the Jews , who early in the twentieth 
Century scored below average on IQ scores and now score well above the 


average. Furthermore, the tests are nearly useless as toob, as confirmed 
by the welldocumented fact that such tests do not predict anything except 
success in schooi Eamings, occupation, productivity—all the important 
measures of success —are unrelated to the test scores . All that tests really 
accomplish is to label youngsters, stigmatizing the ones who do not do well 
and creating a self-fulfüling prophecy that injures the socioeconomically dis- 
advantaged in general and blacks in particular. 


INTELLIGENCE REDUX 

As far as public discussion is concerned, this collection of beliefs, with 
some variations, remains the state of wisdom about cognitive abilities 
and IQ tests. It bears almost no relation to the current state of knowl- 
edge among scholars in the field, however, and therein lies a tale. The 
dialogue about testing has been conducted at two levels during the last 
two decades—the visible one played out in the press and the subter- 
ranean one played out in the technical joumals and books. 

The case of Arthur Jensen is illustrative. To the public, he surfaced 
briefly, published an article that was discredited, and feil back into ob- 
scurity. Within the world of psychometrics, however, he continued to 
be one of the profession’s most prolific scholars, respected for his metic- 
ulous research by colleagues of every theoretical stripe. Jensen had not 
recanted. He continued to build on the same empirical findings that had 
gotten hün into such trouble in the 1960s, but primarily in technical 
publications, where no one outside the profession had to notice. The 
same thing was happening throughout psychometrics. ln the 1970s, 
scholars observed that colleagues who tried to say publicly that IQ tests 
had ment, or that intelligence was substantially inherited, or even that 
intelligence existed as a definable and measurable human quality, paid 
too high a price. Their careers, family lives, relationships with cob 
leagues, and even physical safety could be jeopardized by speaking out. 
Why speak out when there was no compelling reason to do so? Research 
on cognitive abilities continued to flourish, but only in the sanctuary of 
the ivory tower. 

In this cloistered environment, the continuing debate about intelli¬ 
gence was conducted much as debates are conducted within any other 
academic discipline. The public controversy had surfaced some genuine 
issues, and the competing parties set about trying to resolve them. Con- 
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troversial hypotheses were put to the test. Sometimes they were coiv 
firmed > sometimes rejected. Often they led to new questions, which were 
then explored. Substantial progress was made. Many of the issues that 
created such a public furor in the 1970s were resolved, and the study of 
cognitive abilities went on to explore new areas. 

This is not to say that controversy has ended, only that the contn> 
versy within the Professional Intelligence testing community is much 
different from that outside it. The issues that seem most salient in arti- 
cles in the populär press (Isn’t intelligence determined mostly by envi- 
ronment? Aren’t the tests useless because they’re biased?) are not major 
topics of debate within the profession, On many of the publicly discussed 
questions, a scholarly consensus has been reached. 34 Rather, the con- 
tending parties within the Professional community divide along other 
lines. By the early 1990s, they could be roughly divided into three fac- 
tions for our purposes: the classicists, the revisionists, and the radicals. 

The Classicists: Intelligence as a Structure 

The classicists work within the tradition begun by Spearman, seeking 
to identify the components of intelligence much as physicists seek to 
identify the structure of the atom. As of the 1990s, the classicists are for 
practical purposes unanimous in accepting that g sits at the center of 
the structure in a dominating position—not just as an artifact of Statis¬ 
tical manipulation but as an expression of a core human mental ability 
much like the ability Spearman identified at the turn of the Century. In 
their view, g is one of the most thoroughly demonstrated entities in the 
behavioral Sciences and one of the most powerful for understanding so- 
cially significant human Variation. 

The classicists took a long time to reach this level of consensus. The 
ink on Spearman’s first article on the topic in 1904 was barely dry be- 
fore others were arguing that intellectual ability could not be adequately 
captured by g or by any other unitary quantity—and understandably so, 
for common sense rebels against the idea that something so important 
about people as their intellects can be captured even roughly by varia- 
tions in a single quantity. Many of the famous names in the history of 
psychometrics challenged the reality of g } starting with Galton’s most 
eminent early disciple, Karl Pearson, and continuing with many other 
Creative and influential psychometricians. 

In diverse ways, they sought the grail of a set of primary and mutu- 
ally independent mental abilities. For Spearman, there was just one such 


primary ability, g. For Raymond Catteil, there are two kinds of g, crys- 
tallized and fluid , with crystallized g betng general intelligence trans- 
formed into the skills of one’s own culture, and fluid g being the 
all-purpose intellectual capacity from which the crystallized skills are 
formed. ln Louis Thurs tone’s theory of intelligence, there are a half- 
dozen or so primary mental abilities , such as verbal, quantitative, spatial, 
and the like. In Philip Vernon’s theory, intellectual capacities are 
arranged in a hierarchy with g at its apex; in Joy Guilford’s, the struc¬ 
ture of intellect is refined into 120 or more intellectual components. 
The theoretical alternatives to unitary, general intelligence have come 
in many sizes, shapes, and degrees of plausibility. 

Many of these efforts proved to have lasting value. For example, Cat- 
tell’s distinction between fluid and crystallized intelligence remains a 
useful conceptual contrast, just as other work has done much to clarify 
what lies in the domain of specific abilities that g cannot account for, 
But no one has been able to devise a set of tests that do not reveal a 
large general factor of intellectual ability—in other words, something 
very like Spearmans g. Furthermore, the classicists point out, the best 
standardized tests, such as a modern IQ test, do a reasonably good job 
of measuring g. When properly administered, the tests are not measur- 
ably biased against socioeconomic, ethnic, or racial subgroups. They 
predict a wide variety of socially important outcomes. 

This is not the sarne as saying that the classicists are satisfied with 
their understanding of intelligence. g is a Statistical entity, and current 
research is probing the underlying neurologic basisfor it. Arthur jensen, 
the archetypal classicist, has been active in this effort for the last decade, 
returning to Galton’s intuition that performance on elementary cogni¬ 
tive tasks, such as reaction time in recognizing simple pattems of lights 
and shapes, provides an entry point into understanding the physiology 
ofg. 

The Revisioyiists: Intelligence as Information Processing 

A theory of intelligence need not be structural. The emphasis may be 
on process rather than on structure. In other words, it may try to figure 
out what a person is doing when exercising his or her intelligence, rather 
than what elements of intelligence are put together. The great Swiss 
psychologist, ]ean Piaget, started his career in Alfred Binets lahoratoiy 
trying to adapt Cyril Burt’s intelligence tests for Parisian children. Piaget 
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discovered quickly that he was less interested in how well the children 
did than in what errors they made. [35] Errors revealed what the underly- 
mg processes of thought must have been, Piaget believed. It was the 
processes of Intelligence that fascinated him during his long and illus- 
trious career, which led in time to his theory of the stages of cognitive 
development. 

Starting in the 1960s, research on human cognition became the pre- 
occupation of experimental psychologists, displacing the animal learn- 
ing experiments of the earlier period. It was inevitable that the new 
experimentalists would turn to the study of human intelligence in nat- 
ural settings. John B. Carroll and Earl B. Hunt led the way from the cog¬ 
nition laboratory to the study of human intelligence in everyday life. 

Today Yale psychologist Robert Sternberg is among the leaders of this 
development. 

The revisionists share much with the classicists. They accept that a 
general mental ability much like Spearman’s g has to be incorporated 
mto any theory of the structure of intelligence, although they would not 
agree that it accounts for as much of the intellectual Variation among 
people as many classicists claim. They use many of the same Statistical 
tools as the classicists and are prepared to subject their work to the same 
Standards of rigor. Where they differ with the classicists, however, is 

their attitude toward intellectual structure and the tests used to mea- 
sure it. 

Yes, the revisionists argue, human intelligence has a structure, but is 
it worth investing all that effort in discovering what it is? The preoc- 
cupation with structure has engendered preoccupation with summary 
scores, the revisionists say. That, after all, is what an IQ score represents: 
a composite of scores that individually measure quite distinct intellec¬ 
tual processes. “Ofcourse,” Sternberg writes, “a tester can always aver¬ 
age over multiple scores. But are such averages revealing, or do they 
Camouflage more than they reveal? If a person is a wonderful visualizer 
but can barely compose a sentence, and another person can write glow- 
mg prose but cannot begin to visualize the simplest spatial Images, what 

do you really leam about these two people if they are reported to have 
the same IQ?” 36 

By focusing on processes, the revisionists argue, they are working 
richer veins than are those who search for static structure. What really 
counts about intelligence are the ways in which people process the In¬ 
formation they receive. What problem-solving mechanisms do they em- 


ploy? How do they trade off speed and accuracy? How do they combine 
different problem-solving resources into a strategy? Stemberg has fash- 
ioned his own thinking on this topic into what he calls a “triarchy of 
intelligence,” or “three aspects of human information processing.” 37 

The first part of Sternberg’s triarchy attempts to describe the inter¬ 
nal architecture of intellectual functioning, the means by which hu- 
mans translate sensory inputs into mental representations, allocate 
mental resources, infer conclusions from raw material, and acquire skills. 
This architectural component of Sternberg’s theory bears a family re- 
semblance to the classicists’ view of the dimensions of intelligence, but 
it emphasizes process over structure. 

The second part of the triarchic theory addresses the role of intelli¬ 
gence in routinizing performance, starting with completely novel tasks 
that test a person’s insightfulness, flexibility, and creativity, and even- 
tually converting them to routine tasks that can be done without con- 
scious thought. Understand this process, Sternberg argues, and we have 
leverage not just for measuring intelligence but for improving it. 

The third part of Sternberg’s triarchy attacks the question that has 
been central to the controversy over intelligence tests: the relationship 
of intelligence to the real world in which people function. In Sternberg’s 
view, people function by means of three mechanisms: adaptation 
(roughly, trying to make the best of the Situation), shaping the external 
environment so that it conforms more closely to the desired state of af- 
fairs, or selecting a new environment altogether. Sternberg laments the 
inadequacies of traditional intelligence tests in capturing this real-world 
aspect of intelligence and seeks to develop tests that will do so—and, 
in addition, lead to techniques for teaching people to raise their intel¬ 
ligence. 

The Radicals: The Theory of Multiple Intelligences 

Walter Lippmann’s hostility toward intelligence testing was grounded 
in his belief that this most important of all human qualities was too di¬ 
verse, too complex, too changeable, too dependent on cultural context, 
and, above all, too subjective to be measured by answers to a mere list 
of test questions. Intelligence seemed to him, as it does to many other 
thoughtful people who are not themselves expert in testing, more like 
beauty or justice than height or weight. Before something can be mea¬ 
sured, it must be defined, this argument goes. 38 And the problems of defi- 
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nition for beauty, justice, or Intelligence are insuperable. To people who 
hold these views, the claims of the intelligence testers seem naive at 
best and vicious at worst. These views, which are generally advanced 
primarily by nonspecialists, have found an influential spokesman from 
the academy, which is mainly why we include them here. We refer here 
to the theory of multiple intelligences formulated by Howard Gardner, 
a Harvard psychologist. 

Gardner’s general definition of intelligent behavior does not seem 
radical at all. For Gardner, as for many other thinkers on intelligence, 
the notion of problem solving is central. “A human intellectual com- 
petence must entail a set of skills of problem solving,” he writes, a en- 
abling the individual to resolve genuine Problems or difficulties that he or 

she encounters and, when appropriate, to create an effective product_ 

and also must entail the potential for finding or creating problem— 
thereby laying the groundwork for the acquisition of new knowledge.” 3g 

Gardner’s view is radical (a word he uses himself to describe his the¬ 
ory) in that he rejects, virtually without qualification, the notion of a 
general intelligence factor, which is to say that he denies g. Instead, he 
argues the case for seven distinct intelligences: linguistic, musical, 
logical-mathematical, spatial, bodily-kinesthetic, and two forms of "per¬ 
sonal intelligence,” the intrapersonal and the interpersonal, each based 
on its own unique computational capacity. 40 Gardner rejects the criti- 
cism that he has merely redefined the word intelligence by broadening it 
to include what may more properly be called talents: "I place no par- 
ticular premium on the word intelligence , but I do place great importance 
on the equivalence of various human faculties,” he writes. "If critics [of 
his theory] were willing to label language and logical thinking as talents 
as well, and to remove these from the pedestal they currently occupy, 
then I would be happy to speak of multiple talents.” 41 

Gardner’s approach is also radical in that he does not defend his the¬ 
ory with quantitative data. He draws on findings from anthropology to 
zoology in his narrative, but, in a field that has been tntensely quanti¬ 
tative since its inception, Gardner’s work is uniquely devoid of psycho- 
metric or other quantitative evidence. He dismisses factor analysis: 
[G]iven the same set of data, it is possible, using one set of factor- 
analytic procedures, to come up with a picture that Supports the idea of 
a ( g’ factor; using another equally valid method of Statistical analysis, it 
is possible to support the notion of a family of relatively discrete men¬ 
tal abilities.” 1421 He is untroubled by the fact that tests of the varying in¬ 


telligences in his theory seem to be intercorrelated: "I fear. . . that I can- 
not accept these correlations at face value. Nearly all current tests are 
so devised that they call principally upon linguistic and logical facil- 
ity. . . . Accordingly, individuals with these skills are likely to do well 
even in tests of musical or spatial abilities, while those who are not es- 
pecially facile linguistically and logically are likely to be impaled on such 
standardized tests.” 43 And in general, he invites his readers to disregard 
the thorny complexities of the classical and revisionist approaches: 
"When it comes to the interpretation of intelligence testing, we are 
faced with an issue of taste or preference rather than one on which sci¬ 
entific closure is likely to be reached.” 44 

THE PERSPECTIVE OF THIS BOOK 

Given these different ways of understanding intelligence, you will nat- 
urally ask where our sympathies lie and how they shape this book. 

We will be drawing most heavily from the classical tradition. That 
body of scholarship represents an immense and rigorously analyzed body 
of knowledge. By accepted Standards of what constitutes scientific evi¬ 
dence and scientific proof, the classical tradition has in our view given 
the world a treasure of information that has been largely ignored in try- 
ing to understand Contemporary policy issues. Moreover, because our 
topic is the relationship of human abilities to public policy, we will be 
dealing in relationships that are based on aggregated data, which is 
where the classical tradition has the most to offer. Perhaps an example 
will illustrate what we mean. 

Suppose that the question at issue regards individuals: "Given two 11 
year olds, one with an IQ of 110 and one with an IQ of 90, what can 
you teil us about the differences between those two children?” The an- 
swer must be phrased very tentatively. On many important topics, the 
answer must be, "We can teil you nothing with any confidence.” It is 
well worth a guidance counselor’s time to know what these individual 
scores are, but only in combination with a variety of other information 
about the child’s personality, talents, and background. The individuals 
IQ score all by itself is a useful tool but a limited one. 

Suppose instead that the question at issue is: "Given two sixth-grade 
classes, one for which the average IQ is 110 and the other for which it 
is 90, what can you teil us about the difference between those two classes 
and their average prospects for the future?” Now there is a great deal to 
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be said, and it can be said with considerable confidence'—not about any 
one person in either dass but about average outcomes that are impor¬ 
tant to the school, educational policy in general, and society writ large. 
The data accumulated under the classical tradition are extreme ly rieh 
in this regard, as will become evident in subsequent chapters. 

If instead we were more concemed with the development of cogni- 
tive processes than with aggregate social and economic outcomes, we 
would correspondingly spend more time discussing the work of the re- 
visionists. That we do not reflects our focus, not a dismissal of their work. 

With regard to the radicals and the theory of multiple intelligences, 
we share some common ground. Socially significant individual differ- 
ences include a wide ränge of human talents that do not fit within the 
classical conception of intelligence. For certain spheres of life, they mat¬ 
ter profoundly. And even beyond intelligence and talents, people vary 
temperamentally, in personality, style, and character. But we confess to 
reservations about using the word intelligence to describe such factors as 
musical abilities, kinesthetic abilities, or personal skills. It is easy to un- 
derstand how intelligence (ordinarily understood) is part of some as- 
pects of each of those human qualities—obviously, Bach was engaging 
in intelligent activity, and so was Ted Williams, and so is a good used- 
car salesman—but the part intelligence plays in these activities is cap- 
tured fairly well by intelligence as the classicists and revisionists 
conceive of it. In the case of music and kinesthetics, talent is a word with 
a domain and weight of its own, and we are unclear why we gain any- 
thing by discarding it in favor of another word, intelligence, that has had 
another domain and weight. In the case of intrapersonal and interper¬ 
sonal skills, conventional intelligence may play some role, and, to the 
extent that other human qualities matter, words like sensitivity, charm, 
persuasiveness , insight —the list could go on and on—have accumulated 
over the centuries to describe them. We lose precision by using the word 
intelligence to cover them all. Similarly, the effect that an artist or an 
athlete or a salesman creates is complex, with some aspects that may be 
dominated by specific endowments or capacities, others that may be the 
product of learned technique, others that may be linked to desires and 
drives, and still others that are characteristic of the kind of cognitive 
ability denoted by intelligence, Why try to make intelligence do triple or 
quadruple duty? 

We agree emphatically with Howard Gardner, however, that the con- 
cept of intelligence has taken on a much higher place in the pantheon 


of human virtues than it deserves. One of the most insidious but also 
widespread errors regarding IQ, especially among people who have high 
IQs, is the assumption that another person’s intelligence can be inferred 
from casual interactions. Many people conclude that if they see some- 
one who is sensitive, humorous, and talks fluently, the person must 
surely have an ahove-average IQ. 

This identification of IQ with attractive human qualities in general 
is unfortunate and wrong. Statistically, there is often a modest correla- 
tion with such qualities. But modest correlations are of little use in siz- 
ing up other individuals one by one. For example, a person can have a 
terrific sense of humor without giving you a clue about where he is 
within thirty points on the IQ scale. Or a plumber with a measured IQ 
of 100—only an average IQ—can know a great deal about the func- 
tioning of plumhing Systems. He may be able to diagnose probiems, dis- 
cuss them articulately, make shrewd decisions about how to fix them, 
and, vvhile he is working, make some pithy remarks about the presidenbs 
recent speech. 

At the same time, high intelligence has earmarks that correspond to 
a first approximation to the commonly understood meaning of smart. In 
our experience, people do not use smart to mean (necessarily) that a per¬ 
son is prudent or knowledgeable but rather to refer to qualities of men¬ 
tal quickness and complexity that do in fact show up in high test scores. 
To return to our examples: Many witty people do not have unusually 
high test scores, but someone who regularly tosses off impromptu com¬ 
plex puns probahly does (which does not necessarily mean that such 
puns are very funny, we hasten to add). If the plumber runs into a Prob¬ 
lem he has never seen before and diagnoses its source through inferences 
from what he does know, he probahly has an IQ of more than 100 after 
all. In this, language tends to reflect real differences: ln everyday lan- 
guage, people who are called very smart tend to have high IQs. 

All of this is another way of making a point so important that we will 
italicize it now and repeat elsewhere: Measures of intelligence have reli- 
able Statistical rehtionships with important social phenomena , but they are a 
limited tool for deciding what to make of any given individual Repeat it we 
must, for one of the probiems of writing about intelligence is how to re- 
mind readers often enough how little an IQ score teils about whether 
the human being next to you is someone whom you will admire or eher- 
vsh. This thing we know as IQ is important but not a synonym for hu¬ 
man excellence. 
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Idiot Savants and Other Anomalies 


To add one final complication, it is also known that some people with low 
measured IQ occasionally engage in highly developed, complex cognitive 
tasks. So-called idiot savants can (for example) teil you on what day Easter 
occurred in any of the past or future two thousand years. 1451 There are also 
many less exotic examples. For example, a study of successful track bettors 
revealed that some of them who used extremely complicated betting Sys¬ 
tems had below-average IQs and that IQ was not correlated with success. 46 
The trick in interpreting such results is to keep separate two questions: (1) 
If one selects people who have already demonstrated an obsession and suc- 
cess with racetrack betting Systems, will one find a relationship with IQ 
(the topic of the study in question)? versus (2) if one selects a thousand 
people at random and asks them to develop racetrack betting Systems, will 
there be a relationship with IQ (in broad terms, the topic of this book)? 


Howard Gardner has also convinced us that the word Intelligence car¬ 
ries with it undue affect and political baggage. It is still a useful word, 
but we shall subsequently employ the more neutral term cognitive ability 
as often as possible to refer to the concept that we have hitherto called 
intelligence^ just as we will use IQ as a generic synonym for intelligence test 
score . Since cognitive ability is an uneuphonious phrase, we lapse often 
so as to make the text readable. But at least we hope that it will help 
you think of intelligence as just a noun, not an accolade. 

We have said that we will be drawing most heavily on data from the 
classical tradition. That implies that we also accept certain conclusions 
undergirding that tradition. To draw the Strands of our perspective to- 
gether and to set the stage for the rest of the book, let us set them down 
explicitly. Here are six conclusions regarding tests of cognitive ability, 
drawn from the classical tradition, that are by now beyond significant 
technical dispute: 

1. There is such a thing as a general factor of cognitive ability on which 
human beings differ. 

2. All standardized tests of academic aptitude or achievement measure 
this general factor to some degree, but IQ tests expressly designed for 
that purpose measure it most accurately. 

3. IQ scores match, to a first degree, whatever it is that people mean 
when they use the word intelligent or smart in ordinary language. 


4. IQ scores are stable, although not perfectly so, over much of a per- 
son’s life. 

5. Properly administered IQ tests are not demonstrably biased against 
social, economic, ethnic, or racial groups. 

6. Cognitive ability is substantially heritable, apparently no less than 
40 percent and no more than 80 percent. 

All six points have an inverse worth noting. For example, some 
people’s scores change a lot; cognitive ability is not synonymous with 
test scores or with a single general mental factor, and so on. When we 
say that all are “beyond significant technical dispute/’ we mean, in 
effect, that if you gathered the top experts on testing and cog¬ 
nitive ability, drawn from all points of view, to argue over these 
points, away from television cameras and reporters, it wouid quickly 
become apparent that a consensus already exists on all of the points, 
in some cases amounting to near unanimity, And although dispute 
wouid ensue about some of the points, one side—the side repre- 
sented by the way the points are stated—wouid have a clear pre- 
ponderance of evidence favoring it, and those of another viewpoint 
wouid be forced to lean heavily on isolated studies showing anom¬ 
alous results. 

This does not mean that the experts should leave the room with their 
differences resolved. All six points can be accurate as general rules and 
still leave room for differences in the theoretical and practical conclu¬ 
sions that people of different values and perspectives draw from them 
(and from the mass of material about cognitive ability and testing not 
incorporated in the six points). Radicals in the Gardner mold might still 
balk at all the attention being paid to intelligence as the tests measure 
it. But these points, in themselves, are squarely in the middle of the sci¬ 
entific road. 

Having said this, however, we are left with a dilemma. The received 
wisdom in the media is roughly 180 degrees opposite from each of the 
six points. To prove our case, taking each point and amassing a full ac- 
count of the evidence for and against, wouid lead us to write a book just 
about them. Such books have already been written. There is no point 
in our trying to duplicate them. [47] 

We have taken two Steps to help you form your own judgments within 
the limits of this book. First, we deal with specific issues involving the 
six points as they arise in the natural course of the discussion—cultural 
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bias when discussing differences in scores across ethnic groups, for ex- 
ample. Second, we try to provide a level of detail that will satisfy dif¬ 
ferent levels of technical curiosity through the use of boxed material 
(you have already come across some examples), notes, and appendixes. 
Because we expect (and fear) that many readers will go directly to chap- 
ters that especially internst them rather than read the book from cover 
to cover, we also insert periodic reminders about where discussion of 
certain key topics may be found. 


PARTI 

The Emergence of a 
Cognitive Elite 

The twentieth Century dawned on a world segregated into social classes 
defined in terms of money, power, and Status. The andern lines of Sep¬ 
aration based on hereditary rank were being erased, replaced by a more 
complicated set of overlapping lines. Social Standing still played a ma- 
jor role, if less often accompanied by a sword or tiara, but so did out- 
and-out wealth, educational credentials, and, increasingly, talent. 

Our thesis is that the twentieth Century has continued the transfor- 
mation, so that the twenty-first will open on a world in which cogni¬ 
tive ability is the decisive dividing force. The shift is more subtle than 
the previous one but more momentous. Social dass remains the vehi- 
cle of social life, but intelligence now pulls the train. 

Cognitive stratification takes different forms at the top and the bot- 
tom of the scale of intelligence. Part II will look at the bottom. In Part 
I, we look at the top. Its story line is that modern societies identify the 
brightest youths with ever increasing efficiency and then guide them 
into fairly narrow educational and occupational channels. These chan- 
nels are increasingly lucrative and influential, leading to the develop¬ 
ment of a distinct Stratum in the social hierarchy, which we hereby dub 
the Cognitive Elite. The isolation of the brightest from the rest of soci- 
ety is already extreme; the forces driving it are growing stronger rather 
than weaker. Governments can influence these forces but cannot neu- 
tralize them. 

This does not mean that a member of the cognitive elite never crosses 
paths with a person with a low IQ, but the encounters that matter tend 
to be limited. The more intimate or more enduring the human rela- 
tionship is, the more likely it is to be among people similar in intellec- 
tual level. That the brightest are identified has its benefits. That they 
become so isolated and inbred has its costs. Some of these costs are al¬ 
ready visible in American society, while others lie over the horizon. 

25 
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Human society has always had some measure of cognitive stratifica¬ 
tion. The best hunters among the Bushmen of the Kalahari tend to score 
above the average of their tribe on modern intelligence tests and so, 
doubtless, would have the chief ministers in Cheop’s Egypt. 1 The Man¬ 
darins who ran China for centuries were chosen by examinations that 
tested for understanding of the Confucian classics and, in so doing, 
screened for intelligence. The priests and monks of medieval Europe, 
recruited and self-selected for reasons correlated with cognitive ability, 
must have been brighter than average. 

This differentiation by cognitive ability did not coalesce into cogni¬ 
tive classes in premodem societies for various reasons. Clerical celibacy 
was one. Another was that the people who rose to the top on their brains 
were co-opted by aristocratic Systems that depleted their descendants’ 
talent, mainly through the mechanism known as primogeniture. Be- 
cause parents could not pick the brightest of their progeny to inherit 
the title and land, aristocracies feil victim to regression to the mean: 
children of parents with above-average IQs tend to have lower IQs than 
their parents, and their children’s IQs are lower still. Over the course of 
a few generations, the average intelligence in an aristocratic family feil 
toward the population average, hastened by marriages that matched 
bride and groom by lineage, not ability. 

On the other hand, aristocratic societies were not as impermeable to 
social mobility as they tried to be. They allowed at least some avenues 
for ability to rise toward the top, whereupon the brains of the newcomer 
were swapped in marriage for family Connections and titles. England was 
notably sagacious in this regard, steadily infusing new talent into the 
aristocracy by creating peerages for its most successful Commoners. The 
traditional occupations for the younger sons of British peers—army, 
navy, church, and the administration of the empire—gave the ablest 
younger sons in the aristocracy a good chance to rise to the top and help 
sustain the System. Indeed, the success of some English families in sus- 
taining their distinction over several generations was one of the factors 
that prompted Francis Galton to hypothesize that intelligence was in- 
herited. But only a minority of aristocratic families managed this trick. 
It remained true even in England that, after a few generations, the 
holder of any given aristocratic title was unlikely to be smarter than any- 
one eise. When one observer wrote of the aristocracy in Queen Victo- 
ria’s day that “all the social talk is stupid and insipid,” he was being more 
accurate than oerhans he realized. 2 


Even in less rigidly stratified societies, stratification by cognitive abil¬ 
ity has been weak and inconsistent until this Century because the num- 
ber of very bright people was so much greater than the specialized jobs 
for which high intelligence is indispensable. A true cognitive elite re- 
quires a technological society. This raises a distinction that is so impor¬ 
tant, and forgetting it can so easily lead to needless misunderstanding, 
that it is worth emphasizing: To say that most of the people in the cognP 
tively demanding pos/tions of a society have a high IQ is not the same as say- 
ing that most of the people with high IQs are in such positions. It is possible 
to have cognitive screening without having cognitive classes. Mathe- 
matical necessity teils us that a large majority of the smart people in 
Cheop’s Egypt, dynastic China, Elizabethan England, and Teddy Roo- 
sevelt’s America were engaged in ordinary pursuits, mingling, working, 
and living with everyone eise. Many were housewives. Most of the rest 
were farmers, smiths, millers, bakers, carpenters, and shopkeepers. So¬ 
cial and economic stratification was extreme, but cognitive stratifica¬ 
tion was minor. 

So it has been from the beginning of history into this Century. Then, 
comparatively rapidly, a new dass structure emerged in which it became 
much more consistently and universally advantageous to be smart. In 
the next four chapters, we examine that process and its meaning. 


Chapter 1 


Cognitive Class and Education, 

1900-1990 


In the course of the twentieth Century, America opened the doors ofits Col¬ 
leges wider than any previous generation ofAmericans, or other society in his- 
tory, could have imagined possible. This democratization of higher education 
has raised new barriers between people that may prove to be more divisive and 
intractablc than the old ones. 

The growth in the Proportion of people getting College degrees is the most 
obvious result, with a fifteen-fold increase from 1900 to 1990. Even more 
important, the students going to College were being selected ever more effi- 
ciently for their high IQ. The crucial decade was the 1950s, when the per- 
centage of top students who went to College rose by more than it had in the 
Iireceding three decades. By the beginning of the 1990s, about 80 percent of 
all students m the top quartile ofability continued to College after high school 
Among the high school graduates in the top few percentiles of cognitive abil- 
ity t the chances of going to College already exceeded 90 percent. 

Perhaps the most important of all the changes was the transformation of 
America’s elite Colleges . As more bright youngsters went offto College, the Col¬ 
leges themselves began to sort themselves out. Starting in the 1950s, a hand- 
ful of institutions became magnets for the very brightest ofeach years new 
class. In these schools, the cognitive level of the students rose far above the 
rest of the College population. 

Taken together, these trends have stratified America according to cognitive 
ability. 


A perusal of Harvard’s Freshman Register for 1952 shows a class look- 
ing very much as Harvard freshman classes had always looked. 
Under the photographs of the welhscrubbed, mostly East Coast, oven 
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whelmingly white and Christian young men were home addresses from 
places like Philadelphia^ Main Line, the Upper East Side of New York, 
and Boston’s Beacon Hill. A large proportion of the class came from a 
handful of America^ most exclusive boarding schools; Phillips Exeter 
and Phillips Andover alone contributed almost 10 percent of the fresh- 
men that year. 

And yet for all its apparent exclusivity, Harvard was not so hard to 
get into in the fall of 1952. An applicant’s chances of being admitted 
were about two out of three, and close to 90 percent if his father had 
gone to Harvard. 1 With this modest level of competition, it is not sur- 
prising to learn that the Harvard Student body was not uniformly bril- 
liant. In fact, the mean SAT-Verbal score of the incoming freshmen class 
was only 583, well above the national mean but nothing to brag about, 121 
Harvard men came from a ränge of ability that could be duplicated in 
the top half of many state universities. 

Let us advance the scene to 1960. Wilbur J. Bender, Harvard’s dean 
of admissions, was about to leave his post and trying to sum up for the 
board of overseers what had happened in the eight years of his tenure. 
“The figures,” he wrote, “report the greatest change in Harvard admis¬ 
sions, and thus in the Harvard Student body, in a short time—two Col¬ 
lege generations—in our recorded history.” 3 Unquestionably, suddenly, 
but for no obvious reason, Harvard had become a different kind of place. 
The proportion of the incoming students from New England had 
dropped by a third. Public school graduates now outnumbered private 
school graduates. Instead of rejecting a third of its applicants, Harvard 
was rejecting more than two-thirds—and the quality of those applicants 
had increased as well, so that many students who would have been ad¬ 
mitted in 1952 were not even bothering to apply in 1960, 

The SAT scores at Harvard had skyrocketed. In the fall of 1960, the 
average verbal score was 678 and the average math score was 695, an 
increase of almost a hundred points for each test. The average Harvard 
freshman in 1952 would have placed in the bottom 10 percent of the 
incoming class by 1960. ln eight years, Harvard had been transformed 
from a school primarily for the northeastem socioeconomic elite into a 
school populated by the brightest of the bright, drawn from all over the 
country. 

The story of higher education in the United States during the twenti- 
eth Century is generally taken to be one of the great American success 


stories, and with good reason. The record was not without blemishes, 
but the United States led the rest of the world in opening College to a 
mass population of young people of ability, regardless of race, color, 
creed, gender, and financial resources, 

But this success story also has a paradoxically shadowy side, for edu¬ 
cation is a powerful divider and classifier. Education affects income, and 
income divides. Education affects occupation, and occupations divide, 
Education affects tastes and interests, grammar and accent, all of which 
divide. When access to higher education is restricted by class, race, or 
religion, these divisions cut across cognitive levels. But school is in it- 
self, more immediately and directly than any other institution, the place 
where people of high cognitive ability excel and people of low cogni¬ 
tive ability fail. As America opened access to higher education, it 
opened up as well a revolution in the way that the American popula¬ 
tion sorted itself and divided itself. Three successively more efficient 
sorting processes were at work: the College population grew, it was re- 
cruited by cognitive ability more efficiently, and then it was further 
sorted among the Colleges. 


THE COLLEGE POPULATION GROWS 

A social and economic gap separated high school graduates from Col¬ 
lege graduates in 1900 as in 1990; that much is not new. But the social 
and economic gap was not accompanied by much of a cognitive gap, be- 
cause the vast majority of the brightest people in the United States had 
not gone to College. We may make that Statement despite the lack of 
IQ scores from 1900 for the same reason that we can make such State¬ 
ments about Elizabethan England: It is true by mathematical necessity. 
In 1900, only about 2 percent of 23-year-olds got College degrees. Even 
if all of the 2 percent who went to College had IQs of 115 and above 
(and they did not), seven out of eight of the brightest 23-year-olds in 
the America of 1900 would have been without College degrees. This Sit¬ 
uation barely changed for the first two decades of the new Century. Then, 
at the close of World War I, the role of College for American youths be- 
gan an expansion that would last until 1974, interrupted only by the 
Great Depression and World War II. 

The three lines in the figure show trends established in 1920-1929, 
1935-1940, and 1954-1973, then extrapolated. They are there to high- 
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In the twentieth Century, the prevalence of the College 
degree goes from one in fifty to a third of the population 


New bachelor’ s degrees as a percentage of 23-year-olds 



Sourccs: 1900-1959: U.S. Bureau of the Census 1975, H751-765. 1960-1992: DES, 1992, 
Table 229. 


light the three features of the figure worth noting. First, the long per¬ 
spective serves as a counterweight to the common belief that the Col¬ 
lege population exploded suddenly after World War II. It certainly 
exploded in the sense that the number of College students went from a 
wartime trough to record highs, but this is because two generations of 
College students were crowded onto campuses at one time. In terms of 
trendlines, World War II and its aftermath was a blip, albeit a large blip. 
When this anomalous turmoil ended in the mid-1950s, the proportion 
of people getting College degrees was no higher than would have been 
predicted from the trends established in the 1920s or the last half of the 
1930s (which are actually a single trend interrupted by the worst years 
of the depression). 

The second notable feature of the figure is the large upward tilt in 
the trendline from the mid-1950s until 1974. That it began when it 
did—the Eisenhower years—comes as a surprise. The Gl bill’s impact 
had faded and the postwar baby boom had not yet reached College age. 
Presumably postwar prosperity had something to do with it, but the ex- 
planation cannot be simple. The slope remained steep in periods as dif¬ 
ferent as Eisenhower’s late 1950s, LBJ’s mid-1960s, and Nixon’s early 
1970s. 


After 1974 came a peculiar plunge in College degrees that lasted un¬ 
til 1981—-peculiar because it occurred when the generosity of scholar- 
ships and loans, from Colleges, foundations, and government alike, was 
at its peak. This period of declining graduates was then followed by a 
steep increase from 1981 to 1990—also peculiar, in that College was be- 
coming harder to afford for middle-class Americans during those years. 
As of 1990, the proportion of students getting College degrees had more 
than made up for the losses during the 1970s and had established a new 
record, with B.A.s and B.S.s being awarded in such profusion that they 
amounted to 30 percent of the 23-year-old population. 


MAKING GOOD ON THE IDEAL OF OPPORTUNITY 

At first glance, we are telling a story of increasing democracy and in- 
termingling, not of stratification. Once upon a time, the College degree 
was the preserve of a tiny minority; now almost a third of each new co- 
hort of youths earns it. Surely, it would seem, this must mean that a 
broader ränge of people is going to College—including people with a 
broader, not narrower, ränge of cognitive ability. Not so. At the same 
time that many more young people were going to College, they were also 
heing selected ever more efficiently by cognitive ability. 

A Compilation of the studies conducted over the course of the Cen¬ 
tury suggests that the crucial decade was the 1950s. The next figure 
shows the data for the students in the top quartile (the top 25 percent) 
in ability and is based on the proportion of students entering College 
(though not necessarily finishing) in the year following graduation from 
high school. 

Again, the lines highlight trends set in particular periods, here 
1925-1950 and 1950-1960. From one period to the next, the propor¬ 
tion of bright students getting to College leaped to new heights. There 
are two qualifications regarding this figure. First, it is based on high 
school graduates —the only data available over this time period—and 
therefore drastically understates the magnitude of the real change from 
the 1920s to the 1960s and thereafter, because so many of the top quar- 
tile in ability never made it through high school early in the Century 
(see Chapter 6). It is impossible to be more precise with the available 
data, but a reasonable estimate is that as of the mid-1920s, only about 
15 percent of all of the nation’s youth in the top IQ quartile were going 
on to College. 141 It is further the case that almost all of those moving on 




34 The Emergence ofa Cognitive Elite 


Cognitive Class and Educadon, 1900-1990 35 


At mid-century America abruptly becomes more efficient in 
getting the top students to College 

High school graduates in the top IQ 
quartile who went directly to College 
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Between the 1920s and the 1960s, College attendance 
becomes much more closely pegged to IQ 
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to College in the 1920s were going to four-year Colleges, and this leads 
to the second qualification to keep in mind: By the 1970s and 1980s, 
substantial numbers ofthose shown as continuing to College were going 
to a junior College, which are on average less demanding than four-year 
Colleges. Interpreting all the available data, it appears that the Propor¬ 
tion of all American youth in the top IQ quartile who went directly to 
four-year Colleges rose from roughly one youth in seven in 1925 to about 
two out of seven in 1950 to more than four out of seven in the early 
1960s, where ithas remained, with perhaps ashallow upward trend, ever 
since, [5] 

But it is not just that the top quartile of talent has been more effi- 
ciently tapped for College, At every level of cognitive ability, the links 
between IQ and the probability of going to College became tighter and 
more regulär. The next figure summarizes three studies that pennit us to 
calculate the probability of going to College throughout the ability ränge 
over the last seventy years. Once again we are restricted to high school 


graduates for the 1925 data, which overstates the probability of going 
to College during this period. Even for the fortunate few who got a high 
school degree in 1925, high cognitive ability improved their chances of 
getting to College—but not by much. [61 The brightest high school grad¬ 
uates had almost a 60 percent chance of going to College, which means 
that they had more than a 40 percent chance of not going, despite hav- 
ing graduated from high school and being very bright. The chances of 
College for someone merely in the 80th percentile in ability were no 
greater than classmates who were at the 50th percentile, and only 
slightly greater than classmates in the bottom third of the dass. 

Between the 1920s and the 1960s, the largest change in the proba¬ 
bility of going to College was at the top of the cognitive ability distri- 
bution. By 1960, a Student who was really smart—at or near the lOOth 
percentile in IQ—had a chance of going to College of nearly 100 per¬ 
cent. 171 Furthermore, as the figure shows, going to College had gotten 
more dependent on intelligence at the bottom of the distribution, too. 8 
A Student at the 30th percentile had only about a 25 percent chance of 
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going to College '—lower than it had been for high school graduates in 
the 1920s. But a Student in the 80th percentile had a 70 percent chance 
of going to College, well above the proportion in the 1920s. 

The line for the early 1980s is based on students who graduated from 
high school between 1980 and 1982. The data are taken from the Na¬ 
tional Longitudinal Survey of Youth (NLSY), which will figure promi- 
nently in the chapters ahead. Briefly, the NLSY is a very large (originally 
12,686 persons), nationally representative sample of American youths 
who were aged 14 to 22 in 1979, when the study began, and have been 
followed ever since. (The NLSY is discussed more fully in the intro- 
duction to Part II.) The curve is virtually identical to that from the early 
1960s, which is in itself a finding of some significance in the light of the 
tnany upheavals that occurred in American education in the 1960s and 
1970s. 


Didn’t Equal Opportunity in Higher Education Really Open Up 
Düring the 1960s? 

The conventional wisdom holds that the revolution in higher education 
occurred in the last half of the 1960s, as part of the changes of the Great 
Society, especially its affirmative action policies. We note here that the 
proportion of youths going to College rose about as steeply in the 1950s as 
in the 1960s, as shown in the opening figure in this chapter and the ac- 
companying discussion. Chapter 19 considers the role played by affirma¬ 
tive action in the changing College population of recent decades. 

Meanwhile, the sorting process continued in College. College weeds 
out many students, disproportionately the least able. The figure below 
shows the Situation as of the 1980s. 9 The line for students entering Col¬ 
lege reproduces the one shown in the preceding figure. The line for stu¬ 
dents completing the B. A. shows an even more efficient sorting process. 
A high proportion of people with poor test scores—more than 20 per¬ 
cent ofthose in the second decile (between the lOth and 20th centile), 
for example—entered a two- or four-year College. But fewer than 2 per¬ 
cent of them actually completed a bachelor's degree. Meanwhile, about 
70 percent of the students in the top decile of ability were completing 
aB.A. 
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Cognitive sorting continues from the time that students 
enter College to the time they get a degree 
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So a variety of forces have combined to ensure that a high propor¬ 
tion of the nation’s most able youths got into the category of College 
graduates. But the process of defining a cognitive elite through educa¬ 
tion is not complete. The socially most significant part of the parti- 
tioning remains to be described. In the 1950s, American higher 
education underwent a revolution in the way that sorted the College 
population itself, 

THE CREATION OE A COGNITIVE ELITE WITHIN THE 
COLLEGE SYSTEM 

The experience of Harvard with which we began this discussion is a 
parable for the experience of the nation’s university System. Insofar as 
many more people now go to College, the College degree has become 
more democratic during the twentieth Century. But as it became demo- 
cratic, a new elite was developing even more rapidly within the System. 
From the early 1950s into the mid-1960s, the nation’s university System 
not only became more efficient in bringing the bright youngsters to col- 
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lege, it became radically more efficient at sorting the brightest of the 
bright into a handful of elite Colleges. 

The Case of Ivy League and the State of Pennsylvania: The 1920s 
Versus the 1 960s 

Prior to World War II, America had a Stratum of elite Colleges just as it 
has now, with the Ivy League being the best known. Then as now, these 
schools attracted the most celebrated faculty, had the best libraries, and 
sent their graduates on to the best graduate schools and to prestigious 
jobs. Of these elite schools, Harvard was among the most famous and 
the most selective. But what was true of Harvard then was true of the 
other elite schools. They all had a thin layer of the very brightest among 
their students but also many students who were merely bright and a fair 
numher of students who were mediocre. They tapped only a fragment 
of the cognitive talent in the country. The valedictorian in Kalamazoo 
and the Kansas farm girl with an IQ of 140 might not even be going to 
College at all. If they did, they probably went to the nearest state uni- 
versity or to a private College affiliated with their church. 

One of the rare Windows on this period is provided by two little- 
known sources of test score data. The first involves the earliest SATs, 
which were first administered in 1926. As part of that effort, a Stan¬ 
dard ized intelligence test was also completed by 1,080 of the SAT sub- 
jects. In its first annual report, a Commission appointed by the College 
Entrance Examination Board provided a table for converting the SAT 
of that era to IQ scores. 1101 Combining that Information with reports of 
the mean SAT scores for entrants to schools using the SAT, we are able 
to approximate the mean IQs of the entering students to the Ivy League 
and the Seven Sisters, the most prestigious schools in the country at 
that time. 1111 

Judging from this Information, the entering classes of these schools 
in 1926 had a mean IQ of about 117, which places the average Student 
at the most selective schools in the country at about the 88th percentile 
of all the nation’s youths and barely above the 115 level that has often 
been considered the basic demarcation point for prime College mate¬ 
rial. 

In the same year as these SAT data were collected, the Carnegie 
Foundation began an ambitious statewide study of high school seniors 
and their College experience in the entire state of Pennsylvania. 12 By 


happy coincidence, the investigators used the same form of the Otis In¬ 
telligence Test used by the SAT Commission. Among other tests, they 
reported means for the sophomore classes at all the Colleges and uni- 
versities in Pennsylvania in 1928. Pennsylvania was (then as now) a 
large state with a wide variety of public and private schools, small and 
large, prestigious and pedestrian. The IQ equivalent of the average of 
all Pennsylvania Colleges was 107, which put the average Pennsylvania 
Student at the 68th percentile, considerably below the average of the 
elite schools. But ten Pennsylvania Colleges had freshman classes with 
mean IQs that put them at the 75th to 90 percentiles. [l31 In other words, 
students going to any of several Pennsylvania Colleges were, on average, 
virtually indistinguishable in cognitive ability from the students in the 
Ivy League and the Seven Sisters. 

Now let us jump to 1964, the first year for which SAT data for a large 
number of Pennsylvania Colleges are available. We repeat the exercise, 
this time using the SAT-Verbal test as the basis for analysis. 1141 Two im¬ 
portant changes had occurred since 1928. The average freshman in a 
Pennsylvania College in 1964 was much smarter than the average Penn¬ 
sylvania freshman in 1928—at about the 89th percentile. At the same 
time, however, the elite Colleges, using the same fourteen schools rep- 
resented in the 1928 data, had moved much further out toward the edge, 
now boasting an average freshman who was at the 99th percentile of 
the nation’s youth. 

Cognitive Strati/icarion Tfiroughout the College System by the 1960s 

The same process occurred around the country, as the figure below 
shows. We picked out Colleges with freshman SAT-Verbal means that 
were separated by roughly fifty-point intervals as of 1961. 1151 The spe¬ 
cific schools named are representative of those clustering near each 
break point. At the bottom is a state College in the second echelon of a 
state System (represented by Georgia Southern); then comes a large state 
university (North Carolina State), then five successively more selective 
private schools: Villanova, Tulane, Colby, Amherst, and Harvard. We 
have placed the SAT scores against the backdrop of the overall distri- 
bution of SAT scores for the entire population of high school seniors 
(not just those who ordinarily take the SAT), using a special study that 
the College Board conducted in the fall of 1960. The figure points to 
the general phenomenon already noted for Harvard: By 1961, a large 
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Cognitive stratification in Colleges by 1961 
The SAT Distribution for AU High School Seniors 
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gap separated the Student bodies of the elite schools from those of the 
public universities. Within the elite schools, another and significant 
level of stratification had also developed. 

As the story about Harvard indicated, the period of this stratification 
seems to have been quite concentrated, beginning in the early 1950s. Ii6] 
It remains to explain why. What led the nation’s most able College age 
youth (and their parents) to begin deciding so abruptly that State U. 
was no longer good enough and that they should strike out for New 
Haven or Palo Alto instead? 

If the word democracy springs to your tongue, note that democracy— 
at least in the economic sense—had little to do with it. The Harvard 
freshman dass of 1960 comprised fewer children from low-income fam- 
dies, not more, than the freshman dass in 1952. 17 And no wonder. Han 
vard in 1950 had been cheap by today’s Standards. In 1950, total costs 
for a year at Harvard were only $8,800—in 1990 dollars, parents of to- 
day^ College students will be saddened to learn. By I960, total costs 
there had risen to $12,200 in 1990 dollars, ahefty 40 percent increase. 
According to the guidelines of the times, the average family could, if it 


stretched, afford to spend 20 percent of its income to send a child to 
Harvard. 1181 Seen in that light, the proportion of families who could af¬ 
ford Harvard decreased slightly during the 1950s. 1191 Scholarship help in- 
creased but not fast enough to keep pace. 

Nor had Harvard suddenly decided to maximize the test scores of its 
entering dass. In a small irony of history, the Harvard faculty had de¬ 
cided in 1960 not to admit students purely on the basis of academic po¬ 
tential as measured by tests but to consider a broader ränge of human 
qualities. 20 Dean Bender explained why, voicing his fears that Harvard 
would “become such an intdlecfeual hot-house that the unfortunate as- 
pects of a self-conscious ‘intellectualism’ would become dominant and 
the precious, the brittle and the neurotic take over.” He asked a very 
good question indeed: “ln other words, would being part of a super-elite 
in a high prestige Institution be good for the healthy development of 
the ablest 18- to 22-year-olds, or would it tend to be a warping and nar- 
rowing experience?” 21 ln any case, Harvard in 1960 continued, as it had 
in the past and would in the future, to give weight to such factors as the 
applicant’s legacy (was the father a Harvard alum?), his potential as a 
quarterback or stroke for the eight-man shell, and other nonacademic 
qualities. 22 

The baby boom had nothing to do with the change. The leading edge 
of the baby boomer tidal wave was just beginning to reach the campus 
by I960 12 ' 1 

So what had happened? With the advantage of thirty additional years 
of hindsight, two trends stand out more clearly than they did in 1960. 

First, the 1950vS were the years in which television came of age and 
long-distance travel became commonplace. Their effects on the atti- 
tudes toward College choices can only be estimated, but they were surely 
significant. For students coming East from the Midwest and West, the 
growth of air travel and the interstate highway System made travel to 
school faster for affluent families and cheaper for less affluent ones. 
Other effects may have reflected the decreased psychic distance of 
Boston from parents and prospective students living in Chicago or Salt 
Lake City, because of the ways in which the world had become elec- 
tronicaüy smaller. 

Second, the 1950s saw the early stages of an increased demand that 
results not from proportional changes in wealth but from an expanding 
number of affluent customers competing for scarce goods. Price in- 
creases for a wide variety of elite goods have outstripped changes in the 
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consumer price Index or changes in mean income in recent decades, 
sometimes by Orders of magnitude. The cost of Fifth Avenue apartments, 
seashore property, Van Gogh paintings, and rare stamps are all exam- 
ples. Prices have risen because demand has increased and supply can- 
not. In the case of education, new universities are built, but not new 
Princetons, Harvards, Yales, or Stanfords. And though the proportion 
of families with incomes sufficient to pay for a Harvard education did 
not increase significantly during the 1950s, the raw number did. Using 
the 20-percent-of-farmly-income rule, the number of families that could 
afford Harvard increased by 184,000 from 1950 to 1960. Using a 10 per- 
cent rule, the number increased by 55,000. Only a small portion of these 
new families had children applying to College, but the number of slots 
in the ffeshmen classes of the elite schools was also small. College en- 
rollment increased from 2.1 million students in 1952 to 2.6 million by 
1960, meaning a half-million more competitors for available places. It 
would not take much of an increase in the propensity to seek elite ed- 
ucations to produce a substantial increase in the annual applications to 
Harvard, Yale, and the othersJ 241 

We suspect also that the social and cultural forces unleashed by World 
War II played a central role, but probing them would take us far afield. 
Whatever the combination of reasons, the basics of the Situation were 
straightforward: By the early 1960s, the entire top echelon of American 
universities had been transformed. The screens filtering their students 
from the masses had not been lowered but changed. Instead of the old 
screen—woven of dass, religion, region, and old school ties—the new 
screen was cognitive ability, and its mesh was already exceeding fine. 

Changes Since the 1960s 

There have been no equivalent sea changes since the early 1960s, but 
the concentration of top students at elite schools has intensified. As of 
the early 1990s, Harvard did not get four applicants for each opening, 
but closer to seven, highly self-selected and better prepared than ever. 
Competition for entry into the other elite schools has stiffened compa- 
rably. 

Philip Cook and Robert Frank have drawn together a wide variety of 
data documenting the increasing concentration. 25 There are, for exam- 
ple, the Westinghouse Science Talent Search finalists. In the 1960s, 47 
percent went to the top seven Colleges (as ranked in the Barron’s list 


that Cook and Frank used). In the 1980s, that proportion had risen to 
59 percent, with 39 percent going to just three Colleges (Harvard, MIT, 
and Princeton). 1261 Cook and Frank also found that from 1979 to 1989, 
the percentage of students scoring over 700 on the SAT-Verbal who 
chose one of the “most competitive Colleges” increased from 32 to 43 

*. 1271 

percent. 

The degree of partitioning off of the top students as of the early 1990s 
has reached startling proportions. Consider the list of schools that were 
named as the nations top twenty-five large universities and the top 
twenty-five small Colleges in a well-known 1990 rankingJ 1 Together, 
these fifty schools accounted for just 59,000 out of approximately 1.2 
million students who entered four-year institutions in the fall of 1990 
fewer than one out of twenty of the natioiVs freshmen in four-year Col¬ 
leges. But they took in twelve out of twenty of the students who scored 
in the 700s on their SAT-Verbal test. They took in seven out of twenty 
of students who scored in the 600s. l ~ Jl 

The concentration is even more extreme than that. Suppose we take 
just the top ten schools, as ranked by the number of their freshmen who 
scored in the 700s on the SAT-Verbal. Now we are talking about schools 
that enrolled a total of only 18,000 freshmen, one out of every sixty- 
seven nationwide. Just these ten schools—Harvard, Yale, Stanford, 
University of Pennsylvania, Princeton, Brown, University of California 
at Berkeley, Cornell, Dartmouth, and Columbia—soaked up 31 percent 
of the nations students who scored in the 700s on the SAT-Verbal. Har¬ 
vard and Yale alone, enrolling just 2,900 freshmen—roughly 1 out of 
every 400 freshmen—accounted for 10 percent. In other words, scoring 
above 700 is forty times more concentrated in the freshman classes at 
Yale and Harvard than in the national SAT population at large—and 
the national SAT population is already a slice off the top of the distri- 
bution. 1 ' 01 

HOW HIGH ARE THE PART1TIONS? 

We have spoken of “cognitive partitioning” through education, which 
implies separate bins into which the population has been distributed. 
But there has always been substantial intellectual overlap across educa- 
tional levels, and that remains true today. We are trying to convey a Sit¬ 
uation that is as much an ongoing process as an outcome. But before 
doine so, the time has come for the first of a few essential bits of statis- 
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tics: the concepts of distribution and Standard deviation. If you are 
new to statisti.es, we recommend that you read the more detailed 
explanation in Appendix 1; you will enjoy the rest of the book more 
if you do. 

A Digression: Standard Deviations and Why They A re Important 

Very briefly, a distribution is the pattern formed by many individual 
scores. The famous “normal distribution” is a bell-shaped curve, with 
most people getting scores in the middle ränge and a few at each end, 
or “tail,” of the distribution* Most mental tests are designed to produce 
normal distributions. 

A Standard deviation is a common language for expressing scores. 
Why not just use the raw scores (SAT points, IQ points, etc.)? There 
are many reasons, but one of the simplest is that we need to compare re- 
sults on many different tests. Suppose you are told that a horse is six- 
teen hands tall and a snake is quarter of a rod long. Not many people 
can teil you from that Information how the height of the horse com- 
pares to the length of the snake. If instead people use inches for both, 
there is no problem. The same is true for statistics. The Standard devi¬ 
ation is akin to the inch, an all-purpose measure that can be used for 
any distribution. Suppose we teil you that Joe has an ACT score of 24 
and Tom has an SAT-Verbal of 720. As in the case of the snake and the 
horse, you need a lot of information about those two tests before you 
can teil much from those two numbers. But if we teil you instead that 
Joe has an ACT score that is .7 Standard deviation above the mean and 
Tom has an SAT-Verbal that is 2.7 Standard deviations above the mean, 
you know a Lot. 

How big is a Standard deviation? For a test distributed normally, a per- 
son whose score is one Standard deviation below the mean is at the löth 
percentile. A person whose score is a Standard deviation above the mean 
is at the 84th percentile. Two Standard deviations from the mean mark 
the 2d and 98th percentiles. Three Standard deviations from the mean 
marks the bottom and top thousandth ofa distribution. Or, in short, as 
a measure of distance from the mean, one Standard deviation means 
“big,” two Standard deviations means “very big,” and three Standard de- 
viations means “huge.” Standard deviation is often abbreviated “SD,” a 
convention we will often use in the rest of the book. 


Understanding How the Partitions Have Risen 

The figure below summarizes the Situation as of 1930, after three decades 
of expansion in College enrollment but before the surging changes of the 
decades to come. The area under each distribution is composed of peo- 
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ple age 23 and is proportional to its representation in the national pop- 
ulation of such people. The vertical lines denote the mean score for 
each distribution. Around them are drawn normal distribudons 
bell curves—expressed in terms of Standard deviations from the mean. 

It is easy to see from the figure above why cognitive stratification was 
only a minor part of the social landscape in 1930. At any given level of 
cognitive ability, the number of people without College degrees dwarfed 
the number who had them. College graduates and the noncollege pop' 
ulation did not differ much in IQ. And even the graduates of the top 
universities (an estimate based on the Ivy League data for 1928) had 
IQs well within the ordinary ränge of ability. 
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The comparable picture sixty years later, based on our analysis of the 
NLSY, is shown in the next figure, again depicted as normal distribu- 
tions. 1321 Note that the actual distributions may deviate from perfect non 
mality, especially out in the tails. 


Americans with and without a College degree as of 1990 


Three Populations of 23-Year-Olds in 1990 


Areas are proportional 
to the relative sizes of 
the populations. 
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The College population has grown a lot while its mean IQ has risen a 
bit. Most bright people were not going to College in 1930 (or earlier)— 
waiting on the bench, so to speak, until the game opened up to them. By 
1990, the noncoliege population, drained of many bright youngsters, 
had shifted downward in IQ. While the College population grew, the gap 
between College and noncollege populations therefore also grew. The 
largest change, however, has been the huge increase in the intelligence 
of the average Student in the top dozen universities, up a Standard devi- 
ation and a half from where the I vies and the Seven Sisters were in 1930. 
One may see other features in the figure evidently less supportive of cog- 
nitive partitioning. Our picture suggests that for every person within the 
tanks of College graduates, there is another among those without a Col¬ 
lege degree who has just as high an IQ—or at least almost. And as for the 
graduates of the dozen top schools, 1331 while it is true that their mean IQ 
is extremely high (designated by the +2.7 SDs to which the line points), 
they are such a small proportion of the nation’s population that they do 
not even register visually on this graph, and they too are apparently out- 
numbered by people with similar IQs who do not graduate from those 


Colleges, or do not graduate from College at all. Is there anything to be 
concerned about? hlow much partitioning has really occurred? 

Perhaps a few examples will illustrate. Think of your twelve dosest 
friends or colleagues. For most readers of this book, a large majority will 
be College graduates. Does it surprise you to learn that the odds of hav- 
ing even half of them be College graduates are only six in a thousand, if 
people were randomly paired off? [341 Many of you will not think it odd 
that half or more of the dozen have advanced degrees. But the odds 
against finding such a result among a randomly chosen group of twelve 
Americans are actually more than a million to one. Are any of the dozen 
a graduate of Harvard, Stanford, Yale, Princeton, Cal Tech, MIT, Duke, 
Dartmouth, Cornell, Columbia, University of Chicago, or Brown? The 
chance that even one is a graduate of those twelve schools is one in a 
thousand. The chance of finding two among that group is one in fifty 
thousand. The chance of finding four or more is less than one in a bil- 
lion. 

Most readers of this book—this may he said because we know a great 
deal about the Statistical tendencies of people who read a book like 
this—are in preposterously unlikely groups, and this reflects the degree 
of partitioning that has already occurred. 

In seine respects, the results of the exercise today are not so different 
from the results that would have been obtained in former years, Sixty 
years ago as now, the people who were most likely to read a book of this 
nature would be skewed toward those who had friends with College or 
Ivy League College educations and advanced degrees. The differences 
between 1930 and 1990 are these: 

First, only a small portion of the 1930 population was in a position 
to have the kind of circle of friends and colleagues that characterizes 
the readers of this book, We will not try to estimate the proportion, 
which would involve too many assumptions, but you may get an idea by 
examining the small area under the curve for College graduates in the 
1930 figure, and visualize some fractionof that area as representing peo¬ 
ple in 1930 who could conceivably have had the educational circle of 
friends and colleagues you have. They constituted the thinnest cream 
floating on the surface of American society in 1930. In 1990, they con¬ 
stituted a class. 

Second, the people who obtained such educations changed. Suppose 
that it is 1930 and you are one of the small number of people whose cir- 
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cle of twelve friends and colleagues included a sizable fraction of col- 
lege graduates. Suppose you are one of the even tinier number whose 
circle came primarily from the top universities. Your circle, selective and 
uncommon as it is, nonetheless will have been scattered across a wide 
ränge of intelligence, with IQs from 100 on up. Given the same educa- 
tional profile in one’s circle today, it would consist of a set of people with 
IQs where the bottom tenth is likely to be in the vicinity of 120, and 
the mean is likely to be in excess of 130—people whose cognitive abil- 
ity puts them out at the edge of the population at large. What might 
have been a circle with education or social class as its most salient fea- 
ture in 1930 has become a circle circumscribing a narrow ränge of high 
IQ scores today. 

The sword cuts both ways. Although they are not likely to be among 
our readers, the circles at the bottom of the educational scale comprise 
lower and narrower ranges of IQ today than they did in 1930. When 
many youngsters in the top 25 percent of the intelligence distribution 
who formerly would have stopped school in or immediately after high 
school go to College instead, the proportion of high-school-only persons 
whose intelligence is in the top 25 percent of the distribution has to fall 
correspondingly. The occupational effect of this change is that bright 
youngsters who formerly would have become carpenters or truck drivers 
or postal clerks go to College instead, thence to occupations higher on 
the socioeconomic ladder. Those left on the lower rungs are therefore 
likely to be lower and more homogeneous intellectually. Likewise their 
neighborhoods, whichget drainedof the bright and no longerpoor, have 
become more homogeneously populated by a less bright, and even 
poorer, residuum. In other chapters we focus on what is happening at 
the bottom of the distribution of intelligence. 

The point of the exercise in thinking about your dozen closest friends 
and colleagues is to encourage you to detach yourself momentarily from 
the way the world looks to you from day to day and contemplate how 
extraordinarily different your circle of friends and acquaintances is from 
what would be the norm in a perfectly fluid society. This profound iso- 
lation from other parts of the IQ distribution probably dulls our aware- 
ness of how unrepresentative our circle actually is. 

With these thoughts in mind, let us proceed to the technical answer 
to the question, How much partitioning is there in America? It is done 
by expressing the overlap of two distributions after they are equated for 
size. There are various ways to measure overlap. In the following table 


we use a measure called median overlap , which says what proportion of 
IQ scores in the lower-scoring group matched or exceeded the median 
score in the higher-scoring group. For the nationally representative 


Overlap Across the Educational Partitions 

Groups Being Compared Median Overlap 

High school graduates with College graduates 7% 

High school graduates with Ph.D.s, M.D.s, or LL.B.s 1% 

College graduates with Ph.D.s, M.D.s, and LL.Bs 21% 


NLSY sample, most of whom attended College in the late 1970s and 
through the 1980s, the median overlap is as follows: By this measure, 
there is only about 7 percent overlap between people with only a high 
school diploma and people with a B.A. or M.A. And even this small de- 
gree of overlap refers to all Colleges. If you went to any of the top hun- 
dred Colleges and universities in the country, the measure of overlap 
would be a few percentage points. If you went to an elite school, the 
overlap would approach zero. 

Even among College graduates, the partitions are high. Only 21 per¬ 
cent of those with just a B.A. or a B.S. had scores as high as the median 
for those with advanced graduate degrees. Once again, these degrees of 
overlap are for graduates of all Colleges. The overlap between the B.A. 
from a state teachers’ College and an MIT Ph.D. can be no more than a 
few percentage points. 

What difference does it make? The answer to that question will un- 
fold over the course of the book. Many of the answers involve the ways 
that the social fabric in the middle class and working class is altered 
when the most talented children of those families are so efficiently ex- 
tracted to live in other worlds. But for the time being, we can begin by 
thinking about that thin layer of students of the highest cognitive abil- 
ity who are being funneled through rarefied College environments, 
whence they go forth to acquire eventually not just the good life but of- 
ten an influence on the life of the nation. They are coming of age in en¬ 
vironments that are utterly atypical of the nation as a whole. The 
national percentage of 18-year-olds with the ability to get a score of 700 
or above on the SAT-Verbal test is in the vicinity of one in three hun¬ 
dred. Think about the consequences when about half of these students 
are going to universities in which 17 percent of their classmates also had 
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SAT-Vs in the 700s and another 48 percent had scores in the 600s. [351 
It is difficult to exaggerate how different the elite College population is 
from the population at large—first in its level of intellectual talent, and 
correlatively in its Outlook on society, politics, ethics, religion, and all 
the other domains in which intellectuals, especially intellectuals con- 
centrated into communities, tend to develop their own conventional 
wisdoms. 

The news about education is heartening and frightening, more or less 
in equal measure. Heartening, because the nation is providing a College 
education for a high proportion of those who could profit from it. 
Among those who graduate from high school, just about all the bright 
youngsters now get a crack at a College education. Heartening also be- 
cause our most elite Colleges have opened their doors wide for young¬ 
sters of outstanding promise. But frightening too. When people live in 
encapsulated worlds, it becomes difficult for them, even with the best 
of intentions, to grasp the realities of worlds with which they have lit- 
tle experience but over which they also have great influence, both pub¬ 
lic and private. Many of those promising undergraduates are never going 
to live in a community where they will be disabused of their misper- 
ceptions, for after education comes another sorting mechanism, occu- 
pations, and many of the holes that are still left in the cognitive 
partitions begin to get sealed. We now turn to that story. 


Chapter 2 

Cognitive Partitioning by 
Occupation 

People in different Jobs have different average IQs. Lawyers, for example, 
have higher IQs on the average than bus drivers. Whether they must have 
higher IQs than bus drivers is a topic we take up in detail in the next chapter. 
Here we Start by noting simply that people from different ranges on die IQ 
scale end up in different Jobs. 

Whatever the reasonfor the link bemeen IQ and occupation, itgoes deep. 
lf you want to guess an adult male's job Status, the results ofhis childhood IQ 
test help you as much as knowing how many years he went to school 

IQ becomes more important as the job ge ts int ellectually tougher. To be 
able to dig a ditch, you need a strong back but not necessarily a high IQ score. 
To be a master carpenter, you need some higher degree of intelligence along 
with skill with your hands. To be afirst^rate lawyer, you had better come from 
the upper end of the cognitive ability distribution. The same may be said ofa 
handful of other occupations, such as accountants, engineers and architects, 
College teachers, dentists and physicians, mathematicians , and scientists. The 
mean IQ of people entering those fields is in the neighborhoodof 120. ln 1900, 
only one out of twenty people in the top 10 percent in intelligence were in any 
of these occupations, a figure that did not change much through 1940. But 
after 1 940, more and more people with high IQs flowed into those Jobs, and 
by 1990 the same handful of occupations employed about 25 percent of all 
the people in the top tenth of intelligence. 

Düring the same period, IQ became more important for business execu- 
tives. In 1900, the CEO ofa large Company was likely to be a WASP bom 
into affluence . He may have been bright, but that was not mainly how he was 
chosen, Much was still the same as late as 1 950. The next three decades saw 
a great social leveling, as the executive suites filled with bright people who could 
maximize corporate Profits, and never mind if they came from the wrongside 
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ofthe tracks or worshipped at a temple instead ofa church. Meamuhile, the 
College degree became a requirement for many business positions, and gradu- 
ate education went from a rarity to a commonplace among senior executives. 

When one combines the people known to be in high-IQ professions ivith 
estimates of the numbers of business executives who are draum from the top 
tenth in cognitive ability, the results do not leave much room for maneuver. 
The specific proportions are open to argument, but the main point seems be - 
yond dispute: Even as recently as midcentur y, America u/as still a society in 
which most bright people were scattered throughout the wide ränge ofjobs. As 
the Century draws to a close, a very high Proportion ofthat same group is notv 
concentrated within a few occupations that are highly screened for IQ, 


J obs sort people by their IQs, just as College does. But there is a differ- 
ence between educational and occupational sorting. People spend 
only one to two decades in school. School may seem like forever when 
we are there, but we spend most of our lives with the sorting that Cen¬ 
ters on work and carries over into circles of friends and colleagues, and 
into communities—if not physically the same workplaces, communi- 
ties, and friends throughout the life span, then generically similar ones. 
In this chapter, we continue our discussion of the contours of the intel- 
lectual landscape. An examination of occupational sorting will carry us 
through to the end of Part I. 

JOBS AND INTELLIGENCE 

No one decreed that occupations should sort us out by our cognitive 
abilities, and no one enforces the process. It goes on beneath the sur- 
face, guided by its own invisible hand. Testers observe that job Status 
and intelligence test scores have gone together since there were in- 
telligence tests to give. 1 As tests evolved and as the measurement of 
Status was formalized, studying the relation between the jobs and in¬ 
telligence became a cottage industry for social scientists. By now, the 
relation has been confirmed many times, in many countries, and in 
many approaches to the data. 2 

This is not to say that the experts find nothing to quarrel about. The 
technical literature is replete with disagreement. Aside from the purely 
technical bones of contention, the experts argue about whether the IQ- 
job Status connection is a by-product of a more fundamental link be- 


tween educational level and job Status. For example, it takes a law de¬ 
gree to be a lawyer, and it takes intelligence to get into and through law 
school, but aside from that, is there any good reason why lawyers need 
to have higher IQs on average than, say, bus drivers? At the height of 
egalitarianism in the 1970s, the received wisdom in many academic cir¬ 
cles was “no,” with Christopher Jencks’s Inequality the accepted text. 3 
A related argument, stated forcefully by James Fallows, arises over 
whether an IQ score is a credential for certain jobs, like a union card for 
a musician, or whether there is a necessary link between job Status and 
intelligence, like a good ear. 4 By the time we get to the end of Part I, 
our answers to such questions should be clear. Flere we review a few of 
the more illuminating findings, to push the discussion beyond the fact 
that occupational Status is correlated with IQ. 

One notable finding is that the correlation between IQ and job Sta¬ 
tus is just about as high if the IQ test is given in childhood, decades be- 
fore people enter the job market, as it is among young adults who are 
taking an intelligence test after years of education. For example, in a 
small but elegant longitudinal study of childhood intelligence and adult 
outcomes, the boys and girls in the sample were given IQ tests in child¬ 
hood and then their job statuses and levels of schooling were measured 
on Standard scales after they were at least 26 years old. 5 The IQ scores 
they got when they were 7 or 8 years old were about as correlated with 
the Status level of their adult jobs as their adult IQs would have been. 161 
Inasmuch as childhood IQ is more correlated with Status than com- 
pleted education, as it is in some studies, the thesis that IQ scores really 
just measure educational level is weakened. 

Family members typically resemble each other in their occupational 
Status. 171 We are talking here not about a son or a niece or a brother-in- 
law going into the family business but about job Status, however mea¬ 
sured. On rating scales that categorize jobs from those with the highest 
Status to those with the lowest, family members tend to land at similar 
levels. There are many exceptions; we all hear occasionally about fam- 
ilies with several members who are doctors and lawyers plus another 
who is a blue-collar worker, or vice versa. But such stories call attention 
to themselves because they describe rarities. Mostly, relatives occupy 
neighboring, if not the same, rungs on the job Status ladder, and the 
closer the relationship is, the nearer they are. Such commonplace find¬ 
ings have many possible explanations, but an obvious one that is not 
mentioned or tested often by social scientists is that since intelligence 
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runs in families and intelligence predicts Status, Status must run in farrv 
ilies. In fact, this explanation somehow manages to be both obvious and 
controversial. 8 

One useful study of family resemblance in Status comes from Den- 
mark and is based on several hundred men and women adopted in or 
around Copenhagen between 1924 and 1947. 9 Four out of five of these 
adopted people had been placed with their adopting families in their 
first year of life; the average age of placement Overall was 3 months. To 
all intents and purposes, then, the adoptees shared little common en- 
vironment with their biological siblings, but they shared a home envi- 
ronment with their adoptive siblings. In adulthood, they were compared 
with both their biological siblings and their adoptive siblings, the idea 
being to see whether common genes or common home life determined 
where they landed on the occupational ladder. The biologically related 
siblings resembled each other in job Status, even though they grew up 
in different homes. And among them, the full siblings had more simi- 
lar job Status than the half siblings. Meanwhile, adoptive siblings were 
not significantly correlated with each other in job statusJ 10 ' 

THE GROWTH OF HIGHTQ PROFESSIONS 

The above comments apply to all sorts of occupations, from low Status 
to high. But the relationship of IQ to occupations changes as the job 
becomes more cognitively demanding. Almost anyone can become a 
ditch digger (if he has a strong enough back); many can become cabi- 
netmakers (if they have good enough smalhmotor skills), but only peo- 
ple from a fairly narrow ränge of cognitive ability can become lawyers. 
If lawyering pays more than cabinetmaking, what happens as the num- 
ber of lawyering jobs increases, as it has in America? More people with 
high IQs are diverted to lawyering, which means that they are not go- 
ing to become cabinetmakers or ditcH diggers. 

Now imagine that process writ large, and consider what has happened 
within the handful of occupations that are most highly screened for IQ 
We will concentrate here on a dozen such occupations, which we will 
refer to as “high-IQ professions.” Some of them have existed as long as 
IQ tests and are included in the list of occupations for the 1900 census: 
accountants, architects, chemists, College teachers, dentists, engineers, 
awyers, and physicians. Others have emerged more recently or are re- 


labeled in more recent occupational breakdowns: Computer scientists, 
mathematicians, natural scientists, and social scientists. 

The mean IQ of people entering those fields is about 120, give or take 
a few points. 11 The state of knowledge is not perfect, and the sorting 
process is not precise. Different studies find slightly different means for 
these occupations, with some suggesting that physicians have a mean 
closer to 125, for example. 12 Theoretical physicists probably average 
higher than natural scientists in general, Within each professlon, the 
ränge of scores may be large. Even an occupation with a high mean may 
include individuals with modest scores; it will certainly include a sizable 
Proportion below its mean—50 percent of them, if the distribution is 
symmetrical above and below its mean. [ni 

Nonetheless, 120 is a good ballpark figure for estimating the mean 
person in these highTQ professions, and it also has the advantage of 
marking the cutoff point for approximately the top tenth of the entire 
population in IQ. 14 Armed with this Information plus a few conjectures, 
we may explore how cognitive stratification at the top of the American 
labor market has changed over the years. The figure below shows the 
answer for the twentieth Century to date. 

Once again, the portrait of American society depends on vantage 
point. Let us begin with the bottom line, showing the percentage of the 
entire labor force that is engaged in high-IQ professions. There has been 
a proportional increase during the twentieth Century, but these people 
still constituted only about one out of fifteen Americans in the labor 
force as of 1990. 

Now consider Americans in the top 10 percent (the top decile, in 
other words) in cognitive ability—everyone over the age of 25, includ- 
ing housewives, the retired, and others who are not counted as being 
part of the labor force. These people are represented by the middle line 
in the graph. In 1900, the number of jobs in the high-IQ professions 
soaked up only about one out of twenty of these talented people. By 
1990, they soaked up almost five times as many, or one out of four, 

Finally, consider the top line in the graph, which is limited to Amer¬ 
icans who are in both the top decile of IQ and the labor force. In 1900, 
about one out of eleven was in one of the highTQ professions; by 1990, 
more than one out of three. This still leaves almost two out of three of 
them unaccounted for, but we will get to them in the next section of 
the chapter. 
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The top IQ decile becomes rapidly more concentrated 
^ high-IQ professions from 1940 onward 


People in the high-IQ occupations, expressed as a percentage of 
40%- 
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The High-IQ Professions and the Cognitive Elite 

We have been discussing the top decile: everyone with an IQ of 120 or 
higher. What about people in the even more rarefied cognitive elite, the 
top fraction of a centile who are so concentrated in a handful of universi- 
ties during their College years? We have little to teil us exaedy what is hap¬ 
pen ing now. but we know what the Situation was fifty years ago, through 
Lewis Terman’s famous study of 1,500 highly gifted children who were bom 
in the early 1900s and followed throughout their lives. Their average IQs 
were over three Standard deviations above the mean, meaning that the Ter- 
man sample represented about l/300th of the population. As of 1940, the 
members of the Terman sample who had finished their schooling were en- 
gaged in high-IQ professions at three times the rate of people in the top 
10 percent—24 percent for the Terman sample against 8 percent for the 
top decile in 1940, as the preceding figure shows. 16 If that was the case in 
1940, when fewer than one in twelve people in the top decile were work¬ 
ing in high-IQ professions, what might be the proportion for a compara- 
ble sample today? Presumably much higher, though how much higher is 
impossible to estimate with the available data. 1171 


COGNITIVE SCREENS IN THE EXECUTIVE SUITE 

The changes in our twelve high-IQ professions understate how much 
occupational cognitive Segregation there has been in this Century. We 
lack data about other professions and occupations in which mean IQ 
may be comparably high (e.g., military officers, writers, joumalists). But 
the biggest omission involves business executives. For while the mean 
IQ of all people who go into business cannot be near 120, 1,01 both com¬ 
mon sense and circumstantial evidence suggest that people who rise to 
the upper echelons of large businesses tend to have high IQs and that 
this tendency has increased during the course of the Century. 

One source of circumstantial evidence that ties success in major busi¬ 
ness to intelligence is the past and present level of education of busi¬ 
ness executives. [l9J In 1900, more than two-thirds of the presidents and 
chairmen of America’s largest corporations did not have even a College 
degree—not because many of them were poor (few had risen from out- 




58 The Emergence of a Cognitive Elite 


C ogrdtive Partitioning by Occupation 59 


right poverty) but because a College degree was not considered impor¬ 
tant for running a business. 20 A Wall Street tycoon (himself a Harvard 
alumnus) writing in 1908 advised parents that “practical business is the 
best school and College” for their sons who sought a business career and 
that, indeed, a College education “is in many instances not only a hin- 
drance, but absolutely fatal to success.” 21 

The lack of a College education does not mean that senior executives 
of 1900 were necessarily less bright than their counterparts in 1990, But 
other evidence points to a revolution in the recruitment of senior ex¬ 
ecutives that was not much different from the revolution in educational 
stratification that began in the 1950s. In 1900, the CEO of a large Com¬ 
pany was likely to be the archetype of the privileged capitalist elite that 
C. Wright Mills described in The Power Elite: born into affluence, the 
son of a business executive or a Professional person, not only a WASP 
but an Episcopalian WASP. 22 In 1950, it was much the same. The fa- 
thers’ occupations were about the same as they had been in 1900, with 
over 70 percent having been business executives or Professionals, and, 
while Protestantism was less overwhelmingly dominant than it had been 
in 1900, it remained the right religion, with Episcopalianism still being 
the rightest of all, Fewer CEOs in 1950 had been born into wealthy fam- 
ilies (down from almost half in 1900 to about a third), but they were 
continuing to be drawn primarily from the economically comfortable 
part of the population. The proportion coming from poor families had 
not changed. Many CEOs in the first half of the Century had their jobs 
because their family’s name was on the sign above the factory door; many 
had reached their eminent positions only because they did not have to 
compete against more able people who were excluded from the compe- 
tition for lack of the right religion, skin color, national origin, or fam- 
ily Connections. 

In the next twenty-five years, the picture changed. The proportion 
of CEOs who came from wealthy families had dropped from almost half 
in 1900 and a third in 1950 to 5.5 percent by 1976. 23 The CEO of 1976 
was still disproportionately likely to be Episcopalian but much less so 
than in 1900—and by 1976 he was also disproportionately likely to be 
Jewish, unheard of in 1920 or earlier. In short, social and economic back- 
ground was no longer nearly as important in 1976 as in the first half of 
the Century. Educational level was becoming the high road to the ex¬ 
ecutive suite at the same time that education was becoming more de¬ 


in fifty years, the education of the typical CEO 
increases from high school to graduate school 


Percentage of CEOs with... 
70% - ...no more than a 



pendent on cognitive ability, as Chapter 1 showed. The figure above 
traces the change in highest educational attainment from 1900 to 1976 
for CEOs of the largest U.S. Companies. 

The timing of the changes is instructive. The decline of the high 
school-educated chief executive was fairly steady throughout the period. 
College-educated CEOs surged into the executive suite in the 
1925-1950 period. But as in the case of educational stratification, the 
most dramatic shift occurred after 1950, represented by the skyrocket- 
ing proportion of chief executives who had attended graduate school. [24] 
By 1976, 40 percent of the Fortune 500 Companies were headed by in- 
dividuals whose background was in finance or law, fields of study that 
are highly screened for intelligence. So we are left with this conserva- 
rive internretation: Nobodv knows what the IO mean or distribution 


was for executives at the tum of the Century, but it is clear that, as of 
the 1990s, the cognitive screens were up. How far up? The broad enve- 
lope of possibilities suggests that senior business executives soak up a 
large proportion of the top IQ decile who are not engaged in the dozen 
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high-IQ professions. The constraints leave no other possibility. Here are 
the constraints and the arithmetic: 

In 1990, the resident population ages 25 to 64 (the age group in which 
the vast majority of people working in high-IQ professions fall) con- 
sisted of 127 million people. 25 By definition, the top IQ decile thus con- 
sisted of 12.7 million people* The labor force of persons aged 25 to 64 
consisted of 100 million people. The smartest working-age people are 
disproportionately likely to be in the labor force (especially since career 
opportunities have opened up for women). As a working assumption, 
suppose that the labor force of 100 million included 11 million of the 
12.7 million people in the top IQ decile. 

We already know that 73 million people worked in the high-IQ pro¬ 
fessions that year and have reason to believe that about half of those 
(3.65 million) have lQs of 120 or more. Subtracting 3.65 million from 
11 million leaves us with about 7.4 million people in the labor force 
with IQs of 120 or more unaccounted for. Meanwhile, 12.9 million peo¬ 
ple were classified in 1980 as working in executive, administrative, and 
managerial positions. [26) A high proportion of people in those positions 
graduated from College, one screen. They have risen in the corporate 
hierarchy over the course of their careers, which is probably another 
screen for IQ. What is their mean IQ? There is no precise answer. Stud- 
ies suggest that the mean for the job category including all white-collar 
and Professionals is around 107, but that category is far broader than the 
one we have in mind. Moreover, the mean IQ of four-year College grad- 
uates in general was estimated at about 115 in 1972, and senior execu- 
tives probably have a mean above that average. 27 

At this point, we are left with startlingly little room for maneuver. 
How many of those 12.9 million people in executive, administrative, 
and managerial positions have IQs above 120? Any plausible assump¬ 
tion digs deep into the 7.4 million people with IQs of 120 or more who 
are not already engaged in one of the other high-IQ professions and 
leaves us with an extremely high proportion of people of the labor force 
with IQs above 120 who are already working in a high-IQ profession or 
in an executive or managerial position. One could easily make a case 
that the figure is in the neighborhood of 70 to 80 percent. 

Cognitive sorting has become highly efficient in the last half Century, 
but has it really become that efficient? We cannot answer definitely yes, 
but it is difficult to work back through the logic and come up with good 
reasons for thinking that the estimates are far off the mark. 


It is not profitable to push much further along this line because the 
uncertainties become too great, but the main point is solidly estabhshed 
in any case: ln midcentury, America was still a society in which a large 
proportion of the top tenth of IQ, probably a majority, were scattered 
throughout the population, not working in a high-IQ professLon and not 
in a managerial position. As the Century draws to a close, some very htgh 
proportion of that same group is concentrated within those highly 
screened jobs. 



Chapter 3 


The Economic Pressure 
to Partition 


What accounts for the way that people with different levels of IQ end up in 
different occupations? The fashionable explanation has been education. Peo¬ 
ple with high SAT scores get into the best Colleges; people with the high GRE, 
MCAT, or LSAT test scores get into Professional and graduate schools; and 
the education defines the occupation. The SAT score becomes unimportant 
once the youngster has gotten into the right College or graduate school. 

Without doubt, education is pan of the explanation; physicians need a high 
IQ to get into medical school, but they also need to leam the material that 
medical school teaches before they can be physicians . Plenty of hollow cre- 
dentialing goes on as well, ifnot in medicine then in other occupations , as the 
educational degree becomes a ticket for jobs that could be done just as well by 
people without the degree. 

But therelationship ofcognitive ability tojobperformancegoes beyorid that. 
A smarter employee is, on the average, a more proficient employee . This holds 
true within professions: Lawyers with higher IQs are, on the average, more 
productive than lawyers with lower IQs. 1t holds true for skilled blue-collar 
jobs: Carpenters wth high IQs are also (on average) more productive than 
carpenters with lower IQs. The relationship holds , although wealdy , even 
among people in unskilled manual jobs. 

The magnitude of the relationship between cognitive ability and job per- 
fomvmce is greater than once thought. A flood of new analyses during the 
1 980s established several points with large economic and policy implications: 

Test scores predictjob performance because they measure g, Spearmans 
general intelligence factor , not because they identify “aptitude” for a specific 
job. Any broad test of general intelligence predicts proficiency in most com¬ 
mon occupations, and does so more accurately than tests that are narrowly 
constructed around thejob’s specific tasks. 
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The advantage conferred by IQ is long-lasting. Much remains to be 
leamed, but usually the smarter employee tends to remain more productive 
than the less smart employee even after years on the job. 

An IQ score is a better predictor of job productivity than a job interview, 
reference checks, or College transcript. 

Most sweepmgly important, an employer that is free to pick among appli- 
cants can realize large economic gains from hiring those with the highest IQs. 
An economy that lets employers pick applicants with the highest IQs is a sig- 
nißcantly more efficient economy. Herein lies the policy Problem: Since 1971, 
Congress and the Supreme Court have effectively forbidden American em¬ 
ployers from hiring based on Intelligence tests. How much does this policy cost 
the economy? Calculating the answer is complex, so estimates vary widely, 
jrom what one authority thinks was a lower-bound estimate of$80 billion in 

1980 to what another authority called an upper-bound estimate of$13 billion 
for that year. 

Our main point has nothing to do with deciding how large the lass is or how 
large the gain would be if intelligence tests could be freely used for hiring. 
Rather, it is simply that intelligence itself is importantly related to job Perfor¬ 
mance. Laws can make the economy less efficient by forbidding employers to 
use intelligence tests, but laws cannot make intelligence unimportant. 


T this E X>int in the Mussten, the forces that sort people into jobs 
J- accordmg to their cognitive ability remain ambiguous. There are 

three mam possibilities, hinted at in the previous chapter but not as- 
sessed. 

The first is the Standard one: IQ really reflects education. Education 
imparts skills and knowledge—reading, writing, doing arithmetic, 
knowing some facts. The skills and knowledge are valuable in the work- 
place, so employers prefer to hire educated people. Perhaps IQ, in and 
ot itself, has something to do with people’s performance at work, but 
probably not much. Education itself is the key. More is better, for just 
about everybody, to just about any level. 

The second possibility is that IQ is cotrelated with job Status because 
we hve in a worid of attificial credentials. The artisan guilds of old wete 
replaced somewhere along the way by College or graduate degrees. Most 
Patents want to see theit children get at least as much education as 


The Economic Pressure to Partition 65 


they got, in part because they want their children to profit from 
the valuable credentials. As the society becomes richer, more child- 
ren get more education. As it happens, education screens for IQ, 
but that is largely incidental to job performance. The job market, in 
tum, screens for educational credentials. So cognitive stratification 
occurs in the workplace, but it reflects the premium put on educa¬ 
tion, not on anything inherent in either education or cognitive ability 
itself. 

The third possibility is that cognitive ability itself—sheer intellec- 
tual horsepower, independent of education—has market vaiue. Seen 
from this perspective, the College degree is not a credential but an in- 
direct measure of intelligence. People with College degrees tend to be 
smarter than people without them and, by extension, more valuable in 
the marketplace. Employers recruit at Stanford orYale not because grad- 
uates of those schools know more than graduates of less prestigious 
schools but for the same generic reason that Willie Sutton gave for rob- 
bing banks. Places like Stanford and Yale are where you find the coin 
of cognitive talent. 

The first two explanations have some validity for some occupations. 
Even the brightest child needs formal education, and some jobs require 
many years of advanced training. The prohlem of credentialing is wide- 
spread and real: the B.A. is a bogus requirement for many management 
jobs, the requirement for teaching certificates often impedes hiring good 
teachers in elementary and secondary schools, and the Ph.D. is irrele¬ 
vant to the work that many Ph.D.s really do. 

But whatever the mix of truth and fiction in the first two explana¬ 
tions, the third explanation is almost always relevant and almost always 
ignored. The process described in the previous chapter is driven by a 
characteristic of cognitive ability that is at once little recognized and 
essential for understanding how society is evolving: intelligence is fun- 
damentally related to productivity. This relationship holds not only for 
highly skilled professions but for jobs across the Spectrum. The power of 
the relationship is sufficient to give every business some incentive to 
use IQ as an important selection entehon. 

That in brief is the thesis of the chapter. We begin by reviewing the 
received wisdom about the links between IQ and success in life, then 
the evidence specifically linking cognitive ability to job productivity. 
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THE RECE1VED WISDOM 

“Test scores have a modest correlation with first-year grades and no cor- 
relation at all with what you do in the rest of your life,” wrote Derek 
Bok, then President of Harvard University, in 1985, referring to the 
SATs that all Harvard applicants takeJ 1 * Bok was poetically correct in 
ways that a College President understandably wants to emphasize. A 17- 
year-old who has gotten back a disappointing SAT score should not 
think that the future is bleak. Perhaps a freshman with an SAT inath 
score of 500 had better not have his heart set on being a mathemati- 
cian, but if instead he wants to run his own business, become a U .S. Sen¬ 
ator, or make a million dollars, he should not put aside those dreams 
because some of his friends have higher scores. The link between test 
scores and those achievements is dwarfed by the totality of other char- 
acteristics that he brings to his life, and that’s the fact that individuals 
should remember when they look at their test scores. Bok was correct 
in that, for practical purposes, the futures of most of the 18-year-olds 
that he was addressing are open to most of the possibilities that attract 
them. 

President Bok was also technically correct about the students at his 
own university. If one were to assemble the SATs of the incoming fresh- 
men at Harvard and twenty years later macch those scores against some 
quantitative measure of Professional success, the impact could be mod¬ 
est, for reasons we shall discuss. Indeed, if the measure of success was 
the most obvious one, cash income, then the relationship between IQ 
and success among Harvard graduates could be less than modest; it could 
be nil or even negative. 

Finally, President Bok could assert that test scores were meaningless 
as predictors of what you do in the rest of your life without fear of con- 
tradiction, because he was expressing what “everyone knows” about test 
scores and success. The received wisdom, promulgated not only in fea- 
ture stories in the press but codified in landmark Supreme Court deci- 
sions, has held that, first of all, the relation between IQ scores and job 
performance is weak, and, second, whatever weak relationship there is 
depends not on general intellectual capacity but on the particular men¬ 
tal capacities or skills required by a particular job. 131 

There have been several reasons for the broad acceptance of the con- 
clusions President Bok drew. Briefly: 


A Primer on the Correlation Coefficient 


We have periodically mentioned the “correlation coefficient’ without say- 
ing much except that it varies frotn -1 to +1. It is time for a bit more de¬ 
tail, with even more to be found in Appendix 1. As in the case of Standard 
deviations, we urge readers who shy from statistics to take the few minutes 
required to understand the concept. The nature of “correlation will be in- 

creasingly important as we go along. 

A correlation coefficient represents the degree to which one phenom- 
enon is linked to another. Height and weight, for example, have a positive 
correlation (the taller, the heavier, usually). A positive corte ation is one 
that falls between zero and +1, with +1 being an absolutely reliable, linear 
relationship. A negative correlation falls between 0 and -1, with 1 also 
representlng an absolutely reliable, linear relationship, but in the inverse 
direction. A correlation of 0 means no linear relationship whatsoevei. 

A crucial point to keep in mind about correlation coefficients now and 
throughout the rest of the book, is that correlations in the social Sciences 
are seldom much higher than .5 (or lower than -.5) and often much 
weaker—because social events are imprecisely measured and are usually af- 
fected by variables besides the ones that happened to be included in any 
particular body of data. A correlation of .2 can nevertheless be big for 
many social science topics. In terms of social phenomena, modest correla¬ 
tions can produce large aggregate effects. Witness the prosperity of casinos 
despite the statistically modest edge they hold over their customers. 


Moderate correlations mean mairy exceptions. We all know people who 
do not seem all that smart but who handle their jobs much more effec- 
tively than colleagues who probably have more raw intelligence. The 
correlations between IQ and various job-related measures are generally 
in the .2 to .6 ränge. Throughout the rest of the book, keep the follow- 
ing figure in mind, for it is what a highly significant correlation in the 
social Sciences looks like. The figure uses actual data from a randomly 
selected 1 percent of a nationally representative sample, using two vari¬ 
ables that are universally acknowledged to have a large and socially im¬ 
portant relationship, income and education, with the hne showing the 
expected change in income for each increment in years of education. 
For this sample, the correlation was a statistically significant .33, and 
the expected value of an additional year of education was an additional 
$2 800 infamily income-a major substantive increase. Yetlookat how 
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The Variation among individuals that lies 
behind a significant correlation coefficient 


Annual income (thousands of 1990 dollars) 
80- 


70- 


60- 

50- 



Correlation; +.33 

Expected increase in 
income per year of 
education: $2,800 


numerous are the exceptions; note especially how people with twelfth- 
grade educations are spread out all along the income continuum. For 
virtually every topic we will be discussing throughout the rest of the book } a 
plot of the raw data would reveal as many or more exceptions to the general 
Statistical relationship , and this must always be remembered in trying to trans - 
late the general rule to individuals. 

The exceptions associated with modest correlations mean that a wide 
ränge of IQ scores can be observed in almost any job, including com- 
plex jobs such as engineer or physician, a fact that provides President 
Bok and other critics of the importance of IQ with an abundant supply 
of exceptions to any general relationship. The exceptions do not inval¬ 
idste the importance of a statistically significant correlation. 

Restriction of ränge. In any particular job setting, there is a restricted 
ränge of cognitive ability, and the relationship between IQ scores and 
job performance is probably very weak in that setting. Forget about IQ 
for a moment and think about weight as a qualification for being an of¬ 
fensive tackle in the National Football League. The All-Pro probably is 
not the heaviest player. On the other hand, the lightest tackle in the 
league weighs about 250 pounds. That is what we mean by restriction 
of ränge. In terms of correlation coefficients, if we were to rate the per¬ 


formance of every NFL offensive tackle and then correlate those ratings 
with their weights, the result would probably be a correlation near zero. 
Should we then approach the head coaches of the NFL and recommend 
that they try out a superbly talented 150-pound athlete at offensive 
tackle? The answer is no. We would be right in concluding that perfor¬ 
mance does not correlate much with weight among NFL tackles, whose 
weights ränge upward from around 250, but not about the correlation 
in the general population. Imagine a sample of ordinary people drawn 
from the general population and inserted into an offensive line. The 
correlation between the performance of these people as tackles in foot- 
ball games and their weights would be large indeed, The difference be¬ 
tween these two correlations—one for the actual tackles in the NFL and 
the other a hypothetical one for people at large—illustrates the impact 
of restriction of ränge on correlation coefficients. [6] 

Con/usion between a credential and a correlation. Would it be silly to 
require someone to have a minimum score on an IQ test to get a license 
as a barber? Yes. Is it nonetheless possible that IQ scores are correlated 
with barbering skills? Yes. Later in the chapter, we discuss the economic 
pros and cons of using a weakly correlated score as a credential for hir- 
ing, but here we note simply that some people confuse a well-founded 
Opposition to credentialing with a less well-founded denial that IQ cor- 
relates with job performance. 7 

The weaknesses of individual studies, Until the last decade, even the 
experts had reason to think that the relationship must be neghgible. 
Scattered across journals, books, technical reports, Conference pro- 
ceedings, and the records of numberless personnel departments were 
thousands of samples of workers for whom there were two measure- 
ments: a cognitive ability test score of some sort and an estimate of pro- 
ficiency or productivity of some sort. Hundreds of such findings were 
published, but every aspect of this literature confounded any attempt to 
draw general conclusions. The samples were usually small, the measures 
of performance and of worker characteristics varied and were more or 
less unreliable and invalid, and the ranges were restricted for both the 
test score and the performance measure. This fragmented literature 
seemed to support the received wisdom: Tests were often barely predic- 
tive of worker performance and different jobs seemed to call for differ¬ 
ent predictors. And yet millions of people are hired for jobs every year 
in competition with other applicants. Employers make those millions 
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of choices by trying to guess which will be the best worker. What then 
is a fair way for the employer to make those hiring decisions? 

Since 1971, the answer to that question has been govemed by a land- 
mark Supreme Court decision, Griggs v. Duke Tower Co. 8 The Court 
held that any job requirement, including a minimum cutoff score on a 
mental test, must have a “manifest relationship to the employment in 
question” and that it was up to the employer to prove that it did. 9 ln 
practice, this evolved into a doctrine: Employment tests must focus on 
the skills that are specifically needed to perform the job in question. 1101 
An applicant for a job as a mechanic should be judged on how well he 
does on a mechanical aptitude test, while an applicant for a job as a clerk 
should be judged on tests measuring clerical skills, and so forth. So de- 
creed the Supreme Court, and why not? In addition to the expert testi- 
mony before the Court favoring it, it seemed to make good common 
sense. 

THE RECEIVED WISDOM OVERTURNED 

The problem is that common sense tumed out to be wrong. In the last 
decade, the received wisdom has been repudiated by research and by 
common agreement of the leading Contemporary scholars, 1111 The most 
comprehensive modern surveys of the use of tests for hiring, promotion, 
and licensing, in civilian, military, private, and government occupa- 
tions, repeatedly point to three conclusions about worker performance, 
as follows. 

1. Job training and job performance in many common occupations 
are well predicted by any broadly based test of intelligence, as com- 
pared to narrower tests more specifically targeted to the routines 
of the job. As a corollary: Narrower tests that predict well do so 
largely because they happen themselves to be correlated with tests 
of general cognitive ability . 

2. Mental tests predict job performance largely via their loading on 
g- 

3. The correlations between tested intelligence and job performance 
or training are higher than had been estimated prior to the 1980s. 
They are high enough to have economic consequences. 


We state these conclusions qualitatively rather than quantitatively 
so as to span the ränge of expert opinion. Whereas experts in employee 
selection accept the existence of the relationship between cognitive 
ability and job performance, they often disagree with each other’s nu- 
merical conclusions. Our qualitative characterizations should be ac- 
ceptable to those who tend to minimize the economic importance of 
general cognitive ability and to those at the other end of the ränge. 12 

Why has expert opinion shifted? The answer lies in a powerful 
method of Statistical analysis that was developing during the 1970s and 
came of age in the 1980s. Known as meta-analysis, it combines the te- 
sults from many separate studies and extracts broad and stable conclu¬ 
sions. 1131 In the case of job performance, it was able to combine the results 
from hundreds of studies, Experts had long known that the small sam- 
ples and the varying validities, reliabilities, and restrictions of ränge in 
such studies were responsible to some extent for the low, negligible, or 
unstable correlations. What few realized was how different the picture 
would look when these sources of error and underestimation were taken 
into account through meta-analysis. 14 Taken individually, the studies 
said little that could be trusted or generalized; properly pooled, they were 
full of gold. The leaders in this effort—psychologists John Hunter and 
Frank Schmidt have been the most prominent—launched a new epoch 
in understanding the link between individual traits and economic pro- 
ductivity. 

THE LINK BETWEEN COGNITIVE ABILITY AND JOB 
PERFORMANCE 

We begin with a review of the evidence that an important Statistical 
link between IQ and job performance does in fact exist. In reading the 
discussion that follows, remember that job performance does vary in the 
real world, and the variations are not small. Think of your own work- 
place and of the people who hold similar jobs. How large is the differ- 
ence between the best manager and the worst? The best and worst 
secretary? If your workplace is anything like ours have been, the answer 
is that the differences are large indeed. Outside the workplace, what is 
it worth to you to have the name of a first-rate plumber instead of a poor 
one? A first-rate auto mechanic instead of a poor one? Once again, the 
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common experience is that job performance varies widely, with impor- 
tant, tangible consequences for our everyday lives. 

Nor is Variation in job performance limited to skilled jobs. Readers 
who have ever held menial jobs know this firsthand. In restaurants, 
there are better and worse dishwashers, better and worse busboys. There 
are better and worse ditch diggers and garbage collectors. People who 
work in industry know that no matter how apparently mindless a job is, 
the job can still be done better or worse, with significant economic com 
sequences. If the consequences are significant, it is worth knowing what 
accounts for the difference. 

job performance may be measured in many different waysJ 151 Sometimes 
it is expressed as a natural quantitative measure (how many units a per- 
son produces per hour, for example), sometimes as structured ratings by 
Supervisors or peers, sometimes as analyses of a work sample. When these 
measures of job productivity are correlated with measures of intelli- 
gence, the overall correlation, averaged over many tests and many jobs, 
Is about >4> In the study of job performance and tests, the correlation be- 
tween a test and job performance is usually referred to as the validity of 
the test, and we shall so refer to it for the rest of the discussion. 1161 Math- 
ematically, validity and the correlation coefficient are identical. Later 
in the chapter we will show that a validity of A has large economic im- 
plications, and even validities half as large may warrant worrying about. 

This figure of .4 is no more than a point of reference. As one might 
expect, the validities are higher for complex jobs than for simple ones. 
In Edwin Ghiselli’s mammoth Compilation of job performance studies, 
mostly from the first half of the Century, a reanalysis by John Hunter 
found a mean validity of .53 for the job family labeled “manager” and 
.46 for a “trades and crafts worker.” Even an “elementary industrial 
worker” had a mean validity of .37. 17 

The Ghiselli data were extremely heterogeneous, with different stud¬ 
ies using many different measures of cognitive ability, and include data 
that are decades old. A more recent set of data is available from a meta- 
analysis of 425 studies of job proficiency as predicted by the General 
Aptitude Test Battery (GATB), the U.S. Labor Department^ cognitive 
ability test for the screening of workers. The table below summarizes the 
results of John and Ronda Hunter’s reanalysis of these databases. 18 

The average validity in the meta-analysis of the GATB studies was 
.45, U91 The only job category with a validity lower than .40 was the in¬ 


The Validity of the GATB for Different Types of Jobs 


GATB Validity for: 

%ofU.S. 


Proficiency 

Training 

Workers in These 

Job Complexity 

General job families 

High 

Ratings 

Success 

Occupations 

(synthesizing/coordinating) 

Medium 

.58 

.50 

14.7 

(compiling/computing) 

.51 

.57 

62.7 

Low (comparing/copying) 
Industrial job families 

.40 

.54 

17.7 

High (setup work) 

.56 

.65 

2.5 

Low (feed ing/offbearing) 

.23 

NA 

2.4 

Source: Hunter and Hunter 1984, Table 2. 




dustrial category of “feeding/offbearing”—putting something into a ma¬ 
chine or taking it out—which occupies fewer than 3 percent of U.S. 
workers in any case. Even at that bottom-most level of unskilled labor, 
measured intelligente did not entirely lose its predictiveness, with a 
mean validity of .23. 

The third major database bearing on this issue comes from the mili- 
tary, and it is in many ways the most satisfactory. The AFQT (Armed 
Forces Qualification Test) is extracted from the scores on several tests 
that everyone in the armed forces takes. It is an intelligence test, highly 
loaded ong. Everyone in the military goes to training schools, and every¬ 
one is measured for training success at the end of their schooling, with 
“training success” based on measures that directly assess job performance 
skills and knowledge. The job specialties in the armed forces include 
most of those found in the civilian world, as well a number that are not 
(e.g., combat). The military keeps all of these scores in personnel files 
and puts them on Computers. The resulting database has no equal in the 
study of job productivity. 

We will be returning to the military data for a closer look when we 
tum to subjects for which they are uniquely suited. For now, we will sim- 
ply point out that the results from the military conform to the results in 
the civilian job market. The results for training success in the four ma- 



74 


The Emergence of a Cognitive Elite 


The Economic Pressure to Partition 75 


The Validity of the AFQT for Military Training 

Military Job Family 

Mean Validity of 
AFQT Score and 
Training Success 

Mechanical 

.62 

Clerical 

.58 

Electronic 

.67 

General technical 

.62 

Source; Hunter 1985, Table 3. 



jor job families are shown in the table above. These results are based 
on results from 828 military schools and 472,539 military personneL 
The average validity was .62. They hold true for individual schools as 
well. Even the lowest-validity school, combat, in which training suc- 
cess is heavily dependent on physical skills, the validity was still a sub- 
stantial .45. [201 

The lowest modern estimate of validity for cognitive ability is the 
one contained in the report by a panel convened by the National Acad¬ 
emy of Sciences, Famaess in Employment Testing. 11 Tha t report concluded 
that the mean validity is only about .25 for the GATB, in contrast to 
the Hunter estimate of .45 (which we cited earlier). Part of the reason 
was that the Hartigan committee (we name it for its chairman, Yale sta- 
tistician John Hartigan), analyzing 264 studies after 1972, concluded 
that validities had generally dropped in the more recent studies. But the 
main source of the difference in validities is that the committee declined 
to make any correction whatsoever for restriction of ränge (see above 
and note 6), It was, in effect, looking at just the tackles already in the 
NFL; Hunter was considering the population at large. The Hartigan 
committee’s overriding concem, as the title of their report (Fairness in 
Employment Testing ) indicates, was that tests not be used to exclude peo- 
ple, especially blacks, who might turn out to be satisfactory workers. 
Given that priority, the committee’s decision not to correct for restric¬ 
tion of ränge makes sense. But failing to correct for restriction of ränge 
produces a misleadingly low estimate of the overall relationship of IQ 
to job performance and its economic consequences. 1221 Had the Harti¬ 
gan committee corrected for restriction of ränge, the estimates of the 
relationship would have been .35 to .40, not much less than Hunter’s. 


THE REASONS FOR THE LINK BETWEEN COGNITIVE ABILITY 
AND JOB PERFORMANCE 

Why are job performance and cognitive ability correlated? Surgeons, for 
example, will be drawn from the upper regions of the IQ distribution. 
But isn’t it possible that all one needs is “enough” intelligence to be a 
surgeon, after which “more” intelligence doesn’t make much difference? 
Maybe small motor skills are more important. And yet “more” intelli¬ 
gence always seems to be “better ” for large groups of surgeons and every 
other profession. What is going on that produces such a result? 

Specific Skills or g? 

As we begin to explore this issue, the story departs more drastically from 
the received wisdom. One obvious, commonsense explanation is that 
an IQ test indirectly measures how much somebody knows about the 
specifics of a job and that that specific knowledge is the relevant thing 
to measure. According to this logic, more general intellectual capaci- 
ties are beside the point. But the logic, however commonsensical, is 
wrong. Surprising as it may seem, the predictive power of tests for job 
performance lies almost completely in their ability to measure the most 
general form of cognitive ability, g, and has little to do with their abil¬ 
ity to measure aptitude or knowledge for a particular job. 

Specific Skills Versus g in the Military. The most complete data on 
this issue come from the armed Services, with their unique advantages 
as an employer that trains hundreds of tHousands of people for hundreds 
of job specialties. We begin with them and then turn to the corre- 
sponding data from the civilian sector. 

In assigning recruits to training schools, the Services use particular 
combinations of subtests from a test battery that all recruits take, the 
Armed Services Vocational Aptitude Battery (ASVAB).^ ^ The Penta¬ 
gon s psychometricians have tried to determine whether there is any 
practical benefit of using different weightings of the subtests for differ¬ 
ent jobs rather than, say, just using the overall score for all jobs. The 
overall score is itself tantamount to an intelligence test. One of the most 
comprehensive studies of the predictive power of intelligence tests was 
by Malcolm Ree and James Earles, who had both the intelligence test 
scores and the final grades from military school for over 78,000 air force 
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enlisted personnel spread over eighty-nine military specialries. The per- 
sonnel were educationally homogeneous (overwhelmingly high school 
graduates without College degrees), conveniently “controlling” for edu- 
cational background. 1241 

What explains how well they performed? For every one of the eighty- 
nine military schools, the answer was g—Charles Spearman’s general 
intelligence. The correlations betweengalone and military school grade 
ranged from an almost unbelievably high .90 for the course for a tech- 
nical job in avionics repair down to .41 for that for a low-skill job asso- 
ciated with jet engine maintenance.^ Most of the correlations were 
above .7. Overall, g accounted for almost 60 percent of the observed 
Variation in school grades in the average military course, once the re- 
sults were coirected for ränge restriction (the accompanying note spells 
out what it means to account for 60 percent of the observed Varia¬ 
tion”). 1201 

Did cognitive factors othet than g matter at all? The answer is that 
the explanatory power of g was almost thirty times greater than of all 
other cognitive factors in ASVAB combined. The table below gives a 
sampling of the results from the eighty-nine specialties, to illustrate the 


The Role of g in Explaining Training Success for 
Various Military Specialties 


Enlisted Military 

Percentage of Training 

Skill Category 

Success Explained by: 

Nuclear weapons specialist 

8 

Everything Else 

77.3 

0.8 

Air crew operations specialist 

69.7 

1.8 

Weather specialist 

68.7 

2.6 

Intelligence specialist 

66.7 

7.0 

Fireman 

59.7 

0.6 

Dental assistant 

55.2 

1.0 

Security police 

53.6 

1.4 

Vehicle maintenance 

49.3 

7.7 

Maintenance 

28.4 

2.7 

Source; Ree and Earles 1990a, Table 9. 




two commanding findings: g alone explains an extraordinary proportion 
of training success; “everything eise” in the test battery explained very 
little. 

An even larger study, not quite as detailed, involving almost 350,000 
men and women in 125 military specialties in all four armed Ser¬ 
vices, confirmed the predominant influence of g and the relatively 
minor further predictive power of all the other factors extracted 
from ASVAB scores. 27 Still another study, of almost 25,000 air force 
personnel in thirty-seven different military courses, similarly found that 
the validity of individual ASVAB subtests in predicting the final tech- 
nical school grades was highly correlated with the g loading of the 
subtest.^ 281 

EviDENCE FROM ClVILIAN Jobs. There is no evidence to suggest that 
military jobs are unique in their dependence on g. However, scholars 
in the civilian sector are at a disadvantage to their military colleagues; 
nothing approaches the military’s database on this topic. In one of the 
few major studies involving civilian jobs, performance in twenty-eight 
occupations correlated virtuaüy as well with an estimate of g from 
GATB scores as it did with the most predictively weighted individual 
subtest scores in the battery. 1291 The author concluded that, for 
samples in the ränge of 100 to 200, a single factor, g, predicts job 
performance as well as, or better than, batteries of weighted subtest 
scores. With larger samples, for which it is possible to pick up the 
effect of less potent influences, there may be some modest extra 
benefit of specialized weighted scores. At no level of sampling, 
however, does g become anything less than the best single predictor 
known, across the occupational spectrum. Perhaps the most surprising 
finding has been that tests of general intelligence often do better in 
predicting future job performance than do contrived tests of job 
performance itself. Attempts to devise measures that are specifically 
keyed to a job’s tasks—for example, tests of filing, typing, answering 
the telephone, searching in records, and the like for an office 
worker—often yield low-validity tests, unless they happen to measure 
g, such as a vocabulary test. Given how pervasive g is, it is almost 
impossible to miss it entirely with any test, but some tests are far more 
efficient measures of it than others, 30 
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Behind the Test Scores 

Let us try to put these data in the framework of everyday experience. 
Why should it be that Variation in general cognitive ability, g, is more 
important than job-specific skills and knowledge? We will use the job 
of busboy as a specific example, asking the question: At a runof-the- 
mill family restaurant, what distinguishes a really good busboy from an 
average one? 

Being a busboy is a straightforward job, The waiter takes the Orders, 
deals with the kitchen, and serves the food while the busboy totes the 
dirty dishes out to the kitchen, keeps the water glasses filled, and helps 
the waiter serve or clear as required. ln such a job, a high IQ is not re- 
quired. One may be a good busboy simply with diligence and good spir- 
its. But complications arise, A busboy usually works with more than one 
waiter. The restaurant gets crowded. A dozen things are happening at 
once. The busboy is suddenly faced with queuing problems, with setting 
priorities. A really good busboy gets the key Station cleared in the nick 
of time, rememhering that a table of new Orders near that particular Sta¬ 
tion is going to be coming out of the kitchen; when he goes to the 
kitchen, he gets a fresh water pitcher and a fresh condiment tray to save 
an extra trip. He knows which waiters appreciate extra help and when 
they need it. The point is one that should draw broad agreement from 
readers who have held menial jobs: Given the other necessary qualities 
of diligence and good spirits, intelligence helps. The really good busboy 
is engaged in using g when he is solving the problems of his job, and the 
more g he has, the more quickly he comes up with the Solutions and can 
call on them when appropriate. 

Now imagine devising a test that would enable an employer to 
choose the best busboy among applicants. One important aspect of the 
test would measure diligence and good spirits. Perhaps the employer 
should weigh the results of this part of the test more heavily than 
anything eise, if his choice is between a diligent and cheerful applicant 
and a slightly smarter but sulky one. But when it comes to measuring 
performance in general for most applicants, it is easy to see why the re¬ 
sults will match the findings of the literature we just discussed. Job- 
specific items reveal mostly whether an applicant has ever been a 
busboy before. But that makes very little difference to job produc- 
tivity, because a bright person can pick up the basic routine in the 
course of a few shifts. The g-loaded items, on the other hand, will 


reveal whether the applicant will ever become the kind of busboy 
who will clear table 12 before he clears table 20 because he relates the 
needed task to something that happened twenty minutes earlier 
regarding table 15. And that is why employers who want to select pro¬ 
ductive busboys should give applicants a test of general intelligence 
rather than a test of busboy skills. The kind of test that would pass 
muster with the courts—a test of job-specific skills—is a less effective 
kind of test to administer, What applies to busboys applies ever more 
powerfully as the jobs become more complex. 

DOES MORE EXPERIENCE MAKE UP FOR LESS INTELLIGENCE? 

The busboy example leads to another question that bears on how we 
should think about cognitive ability and job productivity: How much 
can experience counterbalance ability? Yes, the smart busboy will be 
more productive than the less-smart busboy a week into the job, and, 
yes, perhaps there will always be a few things that the smart busboy can 
do that the less smart cannot. But will the initial gap in productivity 
narrow as the less-smart busboy gains experience? How much, and how 
quickly? 

Separately, job performance relates to both experience and Intelli¬ 
gence, but the relationships differ. 31 That is, people who are new to a 
job learn quickly at first, then more slowly. A busboy who has, say, one 
month on the job may for that reason outperform someone who started 
today, but the one-month difference in experience will have ceased to 
matter in six months. No comparable leveling-off effect has been ob- 
served for increasing intelligence. Wherever on the scale of intelligence 
pairs of applicants are, the smarter ones not only will outperform the 
others, on the average, but the benefit of having a score that is higher 
by a given amount is approximately the same throughout the ränge. Or, 
to put it more conservatively, no one has produced good evidence of di- 

minishing returns to intelligence. 3 

But what happens when both factors are considered jointly. Uo 
employees of differing intelligence converge after some time on the 
job? If the answer were yes, then it could be argued that hirmg less in¬ 
telligent people imposes only a limited and passing cost. But the answer 
seems to be closer to no than to yes, although much remains to be 

learned. 

Some convergence has been found when SATs are used as the mea- 



80 The Emergence of a C ognitive Elite 


The Economic Pressure to Partition 81 


sure of ability and grade point average is used as the measure of achieve- 
ment. 33 Students with differing SATs sometimes differ more in their 
freshman grades than in later years. That is why President Bok granted 
predictive value to the SAT only for first-year grades. 34 On the other 
hand, the shrinking predictive power may be because students learn 
which courses they are likely to do well in: They drop out of physics or 
third-year calculus, for example, and switch to easier courses. They find 
out which professors are stingy with A’s and B’s. At the U.S. Military 
Academy, where students have very little choice in courses, there is no 
convergence in grades. 35 

When it comes to job performance, the balance of the evidence is 
that convergence either does not occur or that the degree of conver¬ 
gence is small. This was the finding of a study of over 23,000 civilian 
employees at three levels of mental ability (high, medium, and low), us- 
ing Supervisor ratings as the measure of performance, and it extended 
out to job tenures of twenty years and more. 36 A study of four military 
specialties (armor repairman, armor crewman, supply specialist, cook) 
extending out to five years of experience and using three different mea- 
sures of job performance (supervisor’s ratings, work sample, and job 
knowledge) found no reliable evidence of convergence. 37 Still another 
military study, which examined several hundred marines working as ra- 
dio repairmen, automotive mechanics, and riflemen, found no conver¬ 
gence among personnel of differing intelligence when job knowledge 
was the measure of performance but did find almost complete conver¬ 
gence after a year or so when a work sample was the measure. 38 

Other studies convey a similarly mixed pictureJ 391 Some experts are 
at this point concluding that convergence is uncommon in the ordinary 
ränge of jobsJ 401 It may be said conservatively that for most jobs, based 
on most measures of productivity, the difference in productivity associ- 
ated with differences in intelligence diminishes only slowly and par- 
tially. Often it does not diminish at all. The cost of hiring less intelligent 
workers may last as long as they stay on the job, 

TEST SCORES COMPARED TO OTHER PREDICTORS OF 
PRODUCTIVITY 

How good a predictor of job productivity is a cognitive test score com- 
pared to a job interview? Reference checks? College transcript? The an¬ 


swer, probably surprising to many, is that the test score is a better pre¬ 
dictor of job performance than any other single measure. This is the 
conclusion to be drawn from a meta-analysis on the different predictors 
of job performance, as shown in the table below. 


The Validity of Some Different Predictors 
of Job Performance 

Predictor 

Validity Predicting Job 


Performance Ratings 

Cognitive test score 

.53 

Biographical data 

.37 

Reference checks 

.26 

Education 

.22 

Interview 

.14 

College grades 

.11 

Interest 

.10 

Age 

-.01 

Source: Hunter and Hunter 1984. 


The data used for this analysis were top heavy with higher-complex- 
ity jobs, yielding a higher-than-usual validity of .53 for test scores. How- 
ever, even if we were to substitute the more conservative validity 
estimate of .4, the test score would remain the best predictor, though 
with close competition from biographical data. 41 The method that many 
people intuitively expect to be the most accurate, the job interview, has 
a poor record as a predictor of job performance, with a validity of 
only .14. 

Readers who are absolutely sure nonetheless that they should trust 
their own assessment of people rather than a test score should pause to 
consider what this conclusion means. It is not that you would select a 
markedly different set of people through interviews than test scores 
would lead you to select. Many of the decisions would be the same. The 
results in the table say, in effect, that among those choices that would 
be different, the employees chosen on the basis of test scores will on av¬ 
erage be more productive than the employees chosen on the basis of any 
other single item of information. 
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THE DIFFERENCE INTELLIGENCE MAKES 

We arrive finally at the question of what it all means. How important 
is the overall correlation of .4, which we are using as our benchmark for 
the relation between intelligence and job performance ? The temptation 
may be to say, not very. As we showed before, there will be many ex- 
cepnons to the predicted productivity with correlations this modest. 
And indeed it is not very important when an employer needs just a few 
new employees for low-complexity jobs and is choosing among a small 
group of Job applicants who have small differences in test scores. But 

rea ^’ t T departs ffom this scenario, the more important cogni- 
tive ability becomes. 

The Dollar Value of Cognitive Ability 

How much is the Variation in job performance worth? To answer that 
question, we need a measure in dollars of how much the workers in a 
given occupatton vary. (Some of the methods for making this measure- 
ment are recounted in the notes, to which we refer readers who would 
bke more detail.) To cut a long story short, think now ofa particular 
worker a sectetary, let us say. You have a choice between hiring an av¬ 
erage secretary, who by definition is at the 50th percentile, or a first-rate 
one-at the 84th percentile, let us say. If you were free to set their 
salanes at the figures you believe to reflect their true worth, how differ¬ 
ent would they be? We imagine that anyone who has worked with av¬ 
erage secretaries and first-rate ones will answer “a lot.” The consensus 
among experts has been that, measured in dollars, “a lot” works out, on 
the average, to about a 40 percent premium, 

Put more technically and precisely, one Standard deviation of the dis- 
tnbution of workers’ annual productivities in a typical occupation is 
worth 40 percent of the average worker’s annual income. I43) New work 
suggests the premium may actually be twice as large. Since the larger es- 
ttmate has yet to be confirmed, we will base our calculations on the more 
conservative estimate. To take a specific example, for a $20,000-a-year 
Job, which IS correctly priced for an average worker, the incremental 
value of hinng a new worker who is one Standard deviation above the 

for^mn 6 ^ $8 ’°°° per year ‘ 45 ’ Hirin ^ a worker 

forai20,000-a-yearjob who is one Standard deviation below the mean— 

at the 16th percentile—would cost the employer $8,000 in lost output. 
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The Standard deviation for output is usually larger for more complex 
jobs. 46 This makes intuitive sense: an assembly-line worker can do his 
job well or poorly, but ordinarily the gap that separates the proficiency 
of the 16th and 84th percentiles of assembly-line workers is not as great 
measured in the dollar value of the output as the gap that separates the 
proficiency of the 16th and 84th percentiles of engineers. But when we 
match this fact against an additional fact—that engineers make a lot 
more money than assembly-line workers—we are faced with what is 
known in statistics as an interaction effect. Getting high quality for a 
complex job can be worth large multiples of what it is worth to get 
equally high quality for a simpler job. 

We may make this concrete with some hypothetical calculations. 
Imagine a dental office, consisting of dentist and receptionist. Assume 
that the annual salary of an average dentist is $100,000 and that of the 
receptionist $25,000, and that these are correctly priced. For whatever 
reasons, society finds the dentist to be worth four times as much as the 
receptionist. l47J Suppose further that you are an employer—a Health 
Maintenance Organization (HMO), for example—who hires both dem 
tists and receptionists. By using a certain selection procedure, you can 
improve the quality of your new hirees, so that instead of hiring people 
who are, on average, at the 50th percentile of proficiency (which is what 
would happen if you picked randomly from the entire pool of recep¬ 
tionists and dentists lookingfor jobs), you instead couldhire people who 
are, on average, at the 84th percentile. What is this screening proce¬ 
dure worth to you? 

For the value of the output produced, we use a Standard deviation of 
.5 of the annual income for dentists and of .15 for that of receptionists, 
based on values actually observed. 48 The answer, given these numbers, 
is that it is worth $50,000 a year for the dentist and $3,750 per year for 
the receptionist to hire people who are one Standard deviation above 
average in proficiency—not the ratio of four to one that separates the 
dentist’s wages from the receptionist’s but a ratio of more than thirteen 
to one. ,49] 

We are not home yet, for although we know what it is worth to hire 
these more proficient dentists and receptionists, we have not yet fac- 
tored in the validity of the selection test. The correlation between test 
score and proficiency is roughly .6 for dentists and .2 for receptionists, 
again based on observation and approximating the top and bottom of 
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the ränge illustrated in the figure below, Given that Information, we 
may estimate the expected output difference between two dentists 
who score at the 50th and 84th percentiles on an intelligence test as 
being worth $30,000 a year. l50] The corresponding difference between 
two receptionists who score at the 50th and 84th percentiles in in" 
teiligence is $750 a year. And this is what we meant by an “interac- 
tion effect : the wage of the dentist is only four times that of the 
receptionist. But the value to the employer of hiring brighter dentists 
is forty times greater than the value of hiring comparably brighter 
receptionistsJ 511 

In a real-life Situation, the value of a test (or any other selection pro- 
cedure) depends on another factor: How much choice does the employer 
have? 52 There is no point in spending money on an intelligence test if 
only one applicant shows up. If ten applicants show up for the job, how" 
evet, a test becomes attractive. The figure below illustrates the eco¬ 
nomic benefit oftesting with different levels of competition for the job 
(from one to fifty applicants per job) and different tests (from a very 


The advantages of hiring by test score 


Percentage increase in productivity 



poor one with a validity of .2 to a very strong one with a validity of .6). [531 
lf everyone is hired, then, on average, the hired person is just at the av¬ 
erage level of proficiency, which is a Standard score of 0, But as soon as 
even two applicants are available per position, the value of testing rises 
| quickly. With just two applicants per position, the employer gains 16 to 

[ 48 percent in productivity, depending on the validity of the test. t54] The 

curve quickly begins to flatten out; much of the potential value of test- 
ing has already been captured when there are three applicants per job. 

I The figure above is an answer to those who claim that a correlation of, 

say, A is too small to bother with. 55 A validity of A (or even .6) may be 
unimportant if almost all applicants are hired, but even a correlation of 
f .2 (or still smaller) may be important if only a small proportion gets 

l hired. 

The M acroeconomic Costs ofNot Testing 

Since the pivotal Supreme Court decision of Griggs v. Duke Power Co. 

■ in 1971, no large American employer has been able to hire from the top 

down based on intelligence tests. Estimates vary widely for how much 
the American economy loses by not doing so, from what Hunter and 
Hunter conclude is a minimum loss of $80 billion in 1980 (and in 1980 
dollars) to what the Hartigan committee thought was a maximum loss 
of $13 billion for that year. 56 The wide ränge reflects the many impon- 
j derables in making these calculations. For one thing, many attributes of 

| an applicant other than a test score are correlated with intelligence— 

educational level, for example. Schooling captures some, but not all, of 
the predictive value of intelligence. Or consider an employer using fam- 
ily connections to hire instead of tests. A bright worker is likely to have 
: a bright sister or brother. But the average IQ score difference between 

siblings is eleven or twelve points, so, again, test scores would predict 
! proficiency better than judging an applicant by the work of a brother or 

sister. 

: Modeling the economic impact of testing has additional complexi- 

ties. It has been noted that the applicant pool would gradually get de- 
pleted of the top scorers when every successive employer tries to hire 
top down. 59 As the smart people are hired and thereby removed from 
the applicant pool, the validity of a test for those still on the job mar¬ 
ket may change because of, for example, restriction of ränge. The eco- 
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When Only the Best Will Do 


A selection ratio of one in fifty may seem unrealistic, and so it is for the 
run-of-the-mill job. But for the most competitive jobs, much higher ratios, 
up to one in several hundred, are common. Consider the handful of new 
openings in top law firms or for internships in the most desirable research 
hospitals or in the richest investment banking firms for which each year’s 
new graduates are competing. Many potential applicants select themselves 
out of the pool for those prized Jobs, realizing that the openings will be 
filled by people with stronger credentials, but they must nevertheless be 
reckoned as being part of the applicant pool in Order to get a realistic 
estimate of the importance of cognitive ability. This is again the issue 
exemplified by the weight of offensive tackles, discussed earlier in the 
chapter. 

The question arises whether the employer gains much by a rigorous se- 
lection process for choosing among the people who actually do show up at 
the job interview. Aren’t they already so high ly screened that they are, in 
effect, homogeneous? The answer is intimately related to the size of the 
stakes. When the job is in a top Wall Street firm, for example, the dollar 
value of output is so high that the difference between a new hiree who is 
two Standard deviations above the mean and one who is four Standard de- 
viations above the mean on any given predictor measure can mean a huge 
economic difference, even though the “inferior” applicant is already far 
into the top few centiles in ability. 


nomic benefit of using a test would then decline. But if testing tended 
to place the smartest people in the jobs where the test-job correlations 
are large, the spread of the productivity distributions is broad, the ab¬ 
solute levels of output value are high, and the proportions hired are 
small, the benefits could be huge, even if the economic effects of test¬ 
ing the last people in the pool are negligible. In short, figuring out the 
net effects of testing or not testing is no small matter. No one has yet 
done it conclusively. 

WHY PARTITIONING IS INEVITABLE 

To recapitulate a complex discussion: Proficiency in most common 
civilian and military occupations can be predicted by IQ, with an over¬ 


Choosing Police Applicants by IQ 


A case study of what happens when a public Service is able to hire from the 
top down on a test of cognitive ability, drawing on a large applicant pool, 
comes out of New York City. In April 1939, after a decade of economic de- 
pression, New York City attracted almost 30,000 men to a written and 
physical examination for 300 openings in the city’s police force, a selec¬ 
tion ratio of approximately one in a hundred. 57 The written test was simi- 
lar to the intelligence test then being given by the federal civil Service. 
Positions were offered top down for a composite score on the mental and 
physical tests, with the mental test more heavily weighted by more than 
two to one. Not everyone accepted the offer, but, times being what they 
were, the 300 slots were filled by men who earned the top 350 scores. Inas- 
much as the performance of police officers has been shown to correlate sig- 
nificantly with scores on intelligence tests, 58 this group of men should have 
made outstanding policemen. And they did, achieving extraordinarily suc- 
cessful careers in and out of policing. They attained far higher than aver¬ 
age rank as police officers. Of the entire group, four have been police chiefs, 
four deputy commissioners, two chiefs of personnel, one a chief inspector, 
and one became commissioner of the New York Police Department. They 
suffered far fewer disciplinary penalties, and they contributed significantly 
to the study and teaching of policing and law enforcement. Many also had 
successful careers as lawyers, businessmen, and academics after leaving the 
police department. 


all validity that may conservatively be placed at .4. The more demand- 
ing a job is cognitively, the more predictive power such a test has, but 
no common job is so undemanding that the test totally lacks predic- 
tiveness. For the job market as a whoie, cognitive ability predicts profi¬ 
ciency better than any other known variable describing an individual, 
including educational level. Intelligence tests are usually more predic¬ 
tive of proficiency than are paper-and-pencil tests that are specifically 
based on a job’s activities. For selecting large numbers of workers, there 
may be some added predictive power, usually small, when a score on a 
narrower test of performance is combined with an intelligence test. For 
low-complexity jobs, a test of motor skill often adds materially to pre- 
dictiveness. The predictive power of IQ derives from its loading on g, 
in Spearman’s sense of general intelligence. 
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If we were writing a monograph for personnel managers, the appro- 
priate next Step would be to present a handbook of tables for Comput¬ 
ing when it makes economic sense to test new applicants (ignoring for 
the moment legislative and judicial restrictions on such testing), Such 
a calculation would be based on four variables: the predictive power of 
the test for the job at hand, the Variation in worker productivity for 
the job at hand, the proportion of job applicants that are to be selected, 
and the cost of testing, The conclusion would often be that testing is 
profitable. Even a marginally predictive test can be economically im¬ 
portant if only a small fraction of applicants is to be selected. Even a 
marginally predictive test may have a telling economic impact if the 
Variation in productivity is wide. And for most occupations, the test is 
more than marginally predictive. In the average case, a test with a .4 
validity, the employer who uses a cognitive test captures 40 percent of 
the profit that would be realized from a perfectly predictive test—no 
small advantage. In an era when a reliable Intelligence test can be ad- 
ministered in twelve minutes, the costs of testing can be low—lower 
in terms of labor than, for example, conducting an interview or check- 
ing references. 

We are not writing a monograph for personnel managers, however, 
and the main point has nothing to do with whether one favors or op- 
poses the use of tests as a hiring device. The main point is rather that 
intelligence itself is importantly related to job performance. Gettingrid 
of intelligence tests in hiring — a$ policy is txying to do—will not get rid of the 
importance of intelligence . The alternatives that employers have available 
to them—biographical data, reference checks, educational record, and 
so forth—are valid predictors of job performance in part because they 
imperfectly reflect something about the applicant’s intelligence. Em¬ 
ployers who are forbidden to obtain test scores nonetheless strive to ob- 
tain the best possible work force, and it so happens that the way to get 
the best possible work force, other things equal, is to hire the smartest 
people they can find. It is not even necessary for employers to be aware 
that intelligence is the attribute they are looking for. As employers 
check their hiring procedures against the quality of their employees and 
refine their procedures accordingly, the importance of intelligence in 
the selection process converges on whatever real importance it has for 
the job in question, whether or not they use a formal test. 
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Because the economic value of their employees is linked to intelli¬ 
gence, so ultimately are their wages. Let us consider that issue in the 
next chapter, along with some others that have interlocking implica- 
tions as we try to foresee, however dimly, what the future holds for the 
cognitive elite. 




Chapter 4 

Steeper Ladders, Narrower Gates 


Cognitive partitioning through education aml occupatums will continue, and 
there is not much that the gotemnnnt or anwne eise can ein übout it. Eco¬ 
nomics will he ehe main reason. At the same time that eilte Colleges and Pro¬ 
fessional schooLs are ruming out brighter and brighter graduates , the value of 
intelligente in the market place is rising. Wages earned by penple in high-lQ 
occupations have pulled away from the wages in low-lQ occupations, and clif- 
ferences in education cannut explai n ?nasr of this change. 

Annther force /or cognitive purrinonm£ is the increasing physical Segrega¬ 
tion of the cognitive elite fron i the re.st of society. Members of the cognitive 
elite work in jobs that usually keep them off the shopjhor, away from the con- 
structum site, and dose to others who also tend to be smart. Computers and 
electronic communic‘utio?i ?nake it incrcosiTi^/v likely that people who work 
mainly teith their rninds collaborate only with other such people. The isolation 
of the cognitwe elite is compowuled by its choices of where to live, shop, play , 
u.'ors/ii/), und soid its chiklren to school. 

Its isolation is inten.si/ied by an irony of a mobile and democratic society 
like America’s . CCgnime ability is a function of both genes and enviromnent, 
u>ith imphcation.s for egalitär um social policies . The more we sueceed in giv- 
ing every yourigster a chance to develop his or her latent cognitive ability, the 
more uv equalize the environmental sources of differences in mtelligence. 
The irony is that as America eqmlizes the circmnstances of people s lives , 
the rernaming differences in intclligence are increasmgN determined hy dif¬ 
ferences in gares. Meaniehile , high cognitive ahihty me ans , more than ever 
before , that the chances of success in life are good and getting better all the 
time. Putting it all together , success and failure in the American economy, 
and all that goes with it, are incretisingly a matter of the genes that people 
inherit. 

Add to this the phenomenon knoten as assortative matmg. Likes attract 
when it comes to marriage, and intclligence is one of the most important of 




92 The Emergence oj a tr\ ! c 5 Eine 


Steeper Ladders, Nanoiver Ciat es 93 


those likes. When this propensity to mate by IQ is combined with increasingly 
efficient educational and occupational stratification , assomuhv matmg by IQ 
ha$ more powerful effects an rhe next generanon than it had an the previous 
cme. This process too seems to he getting stronger, pan oj che brew creating 
an American dass System. 


A s Mae West said in anorher eontext, goodness has norhing to du 
with it. We are not talking abour what should have been but 
what has been. The educational System does sort by cognitive ability 
at the close of thc twentieth Century in a way that it did not at the 
open ing of the Century. The upper st rata of intelligence are being 
sucked into a compamtively few occupations in a way that they did not 
used to he. Cognitive ability is imporrantly related to job productivitv. 
All of rhese trends will continue under any social policy. We are Op¬ 
timist ic enough to believe that no adininistration, Left or Right, is 
going to impede the education of the hrightest, or forbid the hrightest 
from entering the most cognitive ly demanding occupations, or find 
a way to keep employers from rewarding produetivtty. But we are 
not so optimistic that we can overlook dark shadows accompanying 
the trends. 

To this point, we have avovded saying what social consequences 
might be expected. This omission has been deliberate, for part of a can- 
did answer must be, “We aren’r sure.“ We can be sure only that the trends 
are important. Cognitive stratification as a central social prucess is 
something genuinely new under the sun. One of our purposes is to bring 
it to public attention, hopeful that wisdom will come from encouragmg 
more people to think about it. 

It is impossible to predict all the ways in which cognitive strati- 
fication will internet w r ith the workings of an American democracv that 
is in flux. We do have some thoughts on the matter, however, and in 
this chapter use the available scienrific data to peer into the future. The 
data center on the dynamics that will make cognitive stratification more 
pronounced in the years to come—rhe differences greater, the over- 
lap small er, the Separation wider. We reserve our large r speculations 
about the social consequences tor Chapters 21 and 22. 


THE CHANGINC MARKET FOR ABILITY 

The overriding dynamic that will shape the effects of cognitive stratifi¬ 
cation is thc increasing value of intelligence in the markerplace. The 
smart ones are not only heing recruited to College more efficiently, they 
are not only (on average) more productive in the workplace, their dol- 
lar value to employers is increasing and there is every reason to believe 
that this crend will continue. As it does st), rhe economic gap separat- 
ing the upper cognitive classes from the rest of society will increase. 

The general shape of what has been happen ing is shown in the fig- 
ure for a representative higlvlQ occupation, engineering, compared to 
the average manufacturing employee, starting hack in 1932. As always, 
dollar figures are expressed in 1990 Jollars. The 1950s turn out to have 
been the decaele of hidden revolution tor income, just as it was for ed¬ 
ucation and Status. Throughout the 1930s and 1940s, the average engi- 
neer and the average manufacturing employee remained in roughly a 
constanr economic re lat ionship, even con verging s light ly. Then from 


Engineers’ salaries as an example of how intelligence 
became much more valuable in the 1950s 
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ustics 1989, Tables 80, 106. 




94 The Emergence of a Cogrutive Elite 


Sreeper 1 Widders, Nmrower Gates 95 


Wert' the 1980s Good or Bad for Income? 


The re are half a dozen different ways to view rhe ecouomy Jur ing the 
1980s. Because most of it feil in Ronald Reagans presiJency, an intense 
polirical srruggle to characterize the d ec ade as econom ieally “good" or 
“baJ" has ensued. The inain sourcc of confusion lies in the distinction he- 
tween hou.sehold income, which wenr up for all income groups, driven hy 
the increase in rwivincome families and low unemployment, and real 
wagen, which (generally) rose for whire-eollar workcrs and teil for hlue-coh 
lar workers. There are also contusions rhar arise hecause rhe value of hon- 
efit. packages rose even rhough cash wages did not and hecause of 
controversies over rhe proper calcularion of changes in real purchasing 
power. We will not try to adjudicate rhese issues or the role that President 
Reagan's economic policies playeJ, which have taken vvhole hooks to ar- 
gue out. 


1953 to 1961 the average engineers salary nearly doubled while the 
manufacturing employees salary* followed the same gradually rising 
trend and increased hy only 20 percent. By rhe end of the 1980s, the av¬ 
erage manufacturing employee had to get hy on ahout $2 5,000 a year 
while rhe engineer made an average of $72,000. The difference in their 
purchasing power had tripled since the 1940s, which is enough to put 
them in separate economic hrackets. 

The comparison hetween engineers and manufacturing employees is 
a mierocosm of what has happened generally to American workers. Us- 
ing data from the Current Population Surveys, economists Lawrence 
Katz and Kevin Murphy, among others, have estahlished that from 1965 
to 1987, male workers making the highest 10 percent of wages enjoyed 
a rise of ahout 40 percent, while the real wages ofthose at the eorre- 
sponding low end were close to static. 1 

We upened the chapter hy asserting that cognitive ahility has heen a 
key factor in this process. Next we look at the reasons for this conclu- 
sion. 

The Role of Education 

The Standard way of interpreting the figure for engineers and manufac- 
turing is to talk ahout education. Düring the last quarter-century, real 
wages rose more rhan twice as much for workers with College educations 


than for those with high school or less. 121 Trends were not uninterrupted 
with in the interval. Following the huge expansion of the post-World 
War II College population, it seemed for a while that the economic hen- 
efits of education were beingswamped by oversupp ly, as wages feil dur- 
ing the 1970s for college-educated people.* But in the 1980s, the trend 
reversed. Real wages for highly educated people started once again to 
climh and wages feil for those with twelve or fewer years of school ing. 1-41 

The table below gives the percentage change in real wages for full- 
time male workers 1 ’' 1 at three educational levels during the 1980s, Bro¬ 
ken out hy whether they are new workers (one to five or rwenty-six to 
thirtydive years of work experience). The dramatic changes occurred 


Education, Experience, and Wages, 1979-1987 


New workers ( 1-5 years oj experience) 
Less rhan 1 2 years of school 
High school Jegree 
16 or more years of school 


Percentage Change 
in Wages 

-15.8 
-19.8 
+ 10.8 


Old workers (26-35 years of experience) 

Less rhan 12 years of school -1.9 

High school Jegree -2.8 

16 or more years of school + 1.8 


SourcY: Adupted trom Katz and Murphy, 1990, Fable I. 


among young men just coming mto the labor market. Hhgh school grad- 
uates and dropouts saw their real wages plunge, while young men with 
College educations enjoyed a healthy increase. H Meanwhile, ex per i- 
enced older men saw little real change in income whatever their level 
of education. Why rhe difference hetween rhe age groups? lnterpre- 
tively, wages fc^r men with many years of experience reflect their work 
history as well as their immediate economic value. Wages tor people just 
enfering rhe labor force are more purely an expressionof prevailing mar¬ 
ket forces. The joh market reevaluated schooling during the past lwo 
decades: Educated workers, having heen devalued in the 1970s, hecame 
increasingly valuable in the 1980s, in comparison with less educated 
workers. 1 ' 1 
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Why have rhe economic returns to education lately risen, rherehy 
widening the income gap berween the educated and rhe uneducated? 
Perhaps, say some commentators, the wage inequality prohlem is tech- 
nological, as machines displace people from low-skill johs. Perhaps 
schools are failing to teach people skills thar they used to teach, or mayhe 
the schools are doing as well as ever hut the blue-collar johs that require 
only lowdevel skills are emigrating to countries where lahor is cheaper, 
thereby creating an oversupply of less educated workers in America. Per' 
haps the welfare System is eroding the need to vvork among the low-skill 
population, or the weakening lahor unions are not protecting their eco' 
nomic interesrs, or a declining real minimum wage is letting the wage 
structure sag at the low end. 

These possihilities all hear on a crucial issue: Hou' muc/t good would u 
do to improve education for the people eaining low wages- If somehow the 
government can cajole or entice youths to stay in school tor a few ex- 
tra years, will their economic disadvantage in the new lahor market go 
away? We douht it. Their disadvantage might he diminished, hut only 
modestly. The re is reason to rhink rhat the joh market has heen re' 
warding not just education hut intelligence/ 

The M ysterious Residual 

The indispensable datahase for analy:mg wages over time is the Cur¬ 
rent Population Survey, the monrhly national survey conducted hv ihe 
Bureau of rhe Census and the Bureau of Labor Statistics, which asks peo¬ 
ple only about their years of education, not their IQs. Rut as the So¬ 
phist icated Statistical analyses of wage Variation have accumulated, 
experts have come to agree that something heyond education, gender, 
and experience has been at work to increase income dispariries in re- 
cent times/The spread in real wages grew hetween 1963 and 198? even 
after taking those other factors into aceount. |ul The economic term tor 
this unexplained Variation in wages is “the residual." 1111 

To understand the growing wage inequality requires an account 
of this residual Variation. Residual wage Variation for both men and 
women started rising in about 1970 and seems still to be rising. Among 
economists, there is a consensus that, whatever those residual charac- 
teristics consist of, it has been mainly the demand for them, not their 
supply , that has heen changing and causing increasing wage inequality 
for a generation, with no signs of ahatingT Despite rhe public locus on 


the increasing imporrance of education in the workplace, mosr of the 
increasing wage inequality during rhe past two and a half decades is due 
to changes in the demand for the residual characteristics of workers 
rather than to changes in the demand for education or experience. n The 
job market for people lacking the residual characteristics declined, while 
expanding for people having them. 

The Ca.se for IQ as the Residual 

What then is this residual, this X factor, that increasingly commands a 
wage premium over and above education? It could he a variety of fac- 
tors. It could he rooted in diligence, amhition, or sociability. lMl It could 
he associated with different industries or different firms within indus- 
tries, or different wage norms (e.g., regional variations, variations in 
merit pay), again insofar as they are not accounted for hy the measured 
variables. Or ii could he cognitive ahility. Conclusive evidence is Ward 
to come hy, hut readers will not he surprised to leam that we believe 
that it includes cognitive ahility. There are several lines of support for 
this hypothesis. 

As a first cut at rhe prohlem, the changing wages have something to 
Jo with the shifting occupational structure of our economy. High-sta- 
tus, and therefore relatively high-paying, johs are tipped toward people 
with high intelligence, as Chapter 2 showed. As the higlvend johs have 
become more numerims, demand must rise for t he intellectual abilities 
that they require. When demand rises for any good, including intellb 
gence, the price (in this case, the wages) goes up. Purely on economic 
grounds, then, wage inequality grew as the economic demand for inteb 
ligence climbed. 

We further know from the data discussed in Chapter 3 that cognb 
tive ahility affects how well workers ar all levels do their johs. It smarter 
workers are, on average, better workers, there is reason to believe 
that income within joh categories may he correlated with intelli- 
gence. 

Still furrher, we know that the correlation hetween intelligence and 
income is not much diminished hy partialing out the contributions of 
education, work experience, niarital Status, and other demographic vari¬ 
ables. 1 ^ Such a finding strengthens the idea that the job market is in- 
creasingly rewarding not just education but intelligence. 

Final ly, McKinley Blackburn and David Neumark have provided 
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direct evidence in their analysis of white men in rhe National Longitm 
dinal Survey of Youth (NLSY). Education and intelligence each con- 
tributed to a worker’s income, hut the smart men carned most of the 
extra wage benefit of education during the past dea nie. 1161 The growing 
economic benefits of either schooling or intelligence are dispropor- 
tionately embodied in the rising income of educated people with high 
test scores and in the falling wages earned by less educated people with 
low scores. 11,1 

This premium for IQ applies even within the higlvIQ occupations 
that we discussed in Chapter 2. In the NLSY, among people holding one 
of these jobs, the 1989 weekly eamings (expressed in 1990 dollars) of 
those in the top 10 percent of IQ were $977, compared to $697 for those 
with IQs below the top 10 percent, for an annual income difference of 
over $14,000. llrtJ Even after extracting any effects of their specific occm 
pations (as well as of the differing incomes of men and women), heing 
in the top 10 percent in IQ was still worth over $11,000 in income for 
those in this Collection of prestigious occupations. 

Why Cognitive Ahility Has Become More Valuable to Employers 

This brings us as far as the data on income and intelligence go. Betöre 
leaving the ropic, we offer several reasons why the wage premium for 
intelligence might have increased recently and may he expected to con- 
tinue to increase. 

Perhaps most obvtously is that technology has increased the eco¬ 
nomic value of intelligence. As robots replace factory workers, the iac- 
tory workers’ jobs vanish, but new jobs pop up for people who can design, 
program, and repair robots. The new jobs are not necessarily going to 
be filled by the same people, for they require more intelligence than the 
old ones did. Today s technological frontier is more complex than yes- 
terday’s. Even in traditional Industries like retailing, banking, mining, 
manufacturing, and farming, management gets ever more complex. The 
capacity to understand and manipulate complexity, as earlier chapters 
showed, is approximated by g, or general intelligence. We would have 
predicted that a market economy, faced with this turn of events, would 
soon put intelligence on the sales block. 1t has. Business consultancy is 
a new profession that is soaking up a growing fraction of the graduares 
of the elite Business schools. The Consultants seil mainly their trained 
intelligence to the businesses paying their huge fees. 


A second reason involves the effects of scale, spurred by the growth 
in the size of corporations and markets since World War 11. A person 
who can dream up a sales campaign worth another percentage point or 
two of market share will he sought after. What “sought after” means in 
dollars and Cents depends on what a point of market share is worth. If 
it is worth $500,000, the market for his Services wil I produce one ränge 
of salaries. If a point of market share is worth $5 million, he Ls much 
more valuable. If a point of market share is worth $100 million, he is 
worth a fortune. Now consider that since just 1960, rhe average annual 
sales, per Corporation, of Americas five hundred largest industrial cor¬ 
porations has jumped from $1.8 hillion to $4.6 billion (hoth figures in 
1990 dollars). The same gigantism has affected the value of everything 
from the ability to float successful bond offerings to rhe ahility to nego- 
tiate the best prices for volume purchases hy huge retail ehains. The 
magnitude of the economic consequences of ordinary husiness transac.- 
tions has mushroomed, and with it the value of people who can do their 
work at a marginally higher level of skill. All the evidence we have sug- 
gests that such people have, among their other characteristics, high in¬ 
telligence. There is no reason to think that this prooess will stop soon. 

Then there are the effects of legislation and regulation. Why are cer- 
rain kituls of lawyers who never see the inside of a courtroom ahle to 
command such large fees? ln many cases, hecause a first-rate lawyer can 
make a difference worth tens of millions of dollars in getting a favorahle 
decision from a government agency or slipping through a tax loophole. 
Lawyers are not the only beneficiaries. As the rules of the game gov¬ 
erning private enterprise become ever more labyrinthine, intelligence 
grows in value, sometimes in the most surprising places. One of our coh 
leagues is a social psychologist who Supplements his university salary hy 
serving as an adviser on jury selection, at a Consulting fee of several thou- 
sand dollars per day. Based on his track record, his advice ratses the prob- 
ahility of a favorahle verdict in a liability or patent dispute hy ahout 5 
to 10 percent. When a verdict may represent a swing ot $100 million, 
an edge of that size makes him well worth his large fee. 

We have not exhausted all the reasons that cognitive ahility is 
becoming more valuable in the labor market, but these will serve to 
i Uust rate the the me: The more complex a society becomes, the more 
valuable are the people who are especially good at dealing with conv 
plexity. Barring a change in direction, the future is likely to see the rules 
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for doing Business become yet more complex, to see regulari-on extend 
still further, and to raise still higher the stakes tor having a high IQ. 

The End Result: Prosperity for Those Lncky Enough to Be Intelligent 

After all that has gone betöre, it will come as no surprise to find that 
smart people tend to have high incomes. The advantage enjoyed hy 
those who have high enough lQs to get into the high-IQ oeeupations 
is shown in the figure below. All of the high-IQ oeeupations have me¬ 
dian wages well out on the right-hand side of the distribution.Those 

The high-IQ oeeupations also are well-paid oeeupations 


The Recent American Wage Distribution 



Source; U.S. Department ot Lahor Wl. 


in the top ränge of IQ had incomes that were eonspieuoiisly above those 
with lower lQs even within the high-IQ oeeupations. The overail me¬ 
dian family ineome with a member in one of these oeeupations and with 
an IQ in the top 10 percent was $61,100, putting them at the 84th per- 
centile of family ineomes for their age group. These fortunate people 
were newly out of graduate school or law sehool or medical sehool, still 
near the bottom of their earnings trajectory as of their early thirnes, 
whereas a large proportion of those who had gone into hlue-eollar jobs 
(disproportionarely in the lower IQ deciles) have mueh less room to ad- 


Ineome as a Family Trait 


America has taken great pride in the mohility of generations: enterprising 
children of poor families are supposed to do better than their parents, and 
the wastrei children of the rieh are supposed to fritter away the family for- 
tune. But in modern America, this mobility has its limits. The experts now 
helieve that the correlation between fathers’ and sons' ineome is at least 
.4 and perhaps c loser to .5. |21| Think of it this way: The son of a father whose 
earnings are in the bottom 5 percent of the distribution has something 1 ike 
one chance in twenty (or less) of rising to the top fifth of the ineome dis¬ 
tribution and almost a fifty-fifty chance of staying in the bottom fifth. He 
has less than one chance in four of rising above even the median ineome. 22 
Economists search for explanations of this phenomenon in struetural fea- 
tures of the economy. We add the element of intellectual stratification. 
Most people at present are stuck near where their parents were on the in- 
come distribution in part because IQ, which has become a major predic- 
tor of ineome, passes on sufficiently from one generation to the next to 
constrain economic mobility. 


vance beyond this age. 1201 In other words, the occupational elite is pros- 
perous. Within it, the cognitive elite is more prosperous still. 

COGNITIVE SORTING THROUGH PHYS1CAL SEPARATION 

The effects of cognitive sorting in education and occupation are reified 
through geography. People with similar cognitive skills are put together 
in the workplace and in neighhorhoods. 

Cognitive Segregation in the Workplace 

The higher the level of cognitive ability and the greater the degree of 
homogeneity among people involved in that line of work, the greater is 
the degree of Separation of the cognitive elite from everyone eise. First, 
consider a workplace with a comparatively low level of cognitive ho¬ 
mogeneity—an industrial plant. In the physical confines of the plant, 
all kinds of abilities are being called upon: engineers and machinists, 
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electricians and pipefitters and sweepers, foremen and shift Supervisors, 
and the workers on che loading dock. The shift Supervisors and engi- 
neers may have offices that give them some physical Separation from the 
plant floor, but, as manufacturers have come to realize in recent years, 
they had hetter not spend all their time in those offices. Efficient and 
profitable production requires not only that very different tasks he ac- 
complished, using people of every level ot cognitive ability, but that they 
be accomplished cooperatively. If the manufacturing Company is pros- 
pering, it is likely that a fair amount ot daily intermingling of cognitive 
classes goes on in the plant. 

Now we move across the Street to the Companys office building. Here 
the average level of intelligence is higher and the spread is narrower. 
Only a handful of jobs, such as janitor, can be performed by people wich 
low cognitive ability. A number of jobs can be done by people of aver- 
age ability—data entry clerks, for example. Some jobs that can he done 
adequately by people with average cognitive ability tum into virtually 
a different, and much more important, sort ot job if done superbly. The 
job of secretary is the classic example. The traditional executive secre- 
tary, rising through the secretarial ranks until she takes charge of the 
boss’s office, was once a familiär career parh for a really capable, no doubt 
smart, woman. For still other jobs, cognitive ability is important hut less 
important than other talents—among the sales representatives, tor 
example. And finally there is a layer of jobs among the senior execu- 
tives and in the R&D department for which cognitive ability is im¬ 
portant and where the mean IQ had better be high if the Company 
is to survive and grow in a competitive industry. In the office building, 
not only cognitive homogeneity has increased; so has physical Separa¬ 
tion. The executives do not spend much time with the janitors or the 
data entry clerks. They spend almost all their time interacting with 
other executives or with technical specialists, which means with 
people drawn from the upper portion of the ability distribution. 

Although corporate offices are more stratified for intelligence than 
the manufacturing plant, some workplaces are even more stratified. Let’s 
move across town to a law firm. Once again, the mean IQ rises and the 
Standard deviation narrows. Now there are only a few job categories— 
for practical purposes, three: secretaries, paralegals or other forms of le¬ 
gal assistants, and the attorneys. The lowest categories, secretarial and 
paralegal work, require at least average cognitive skills for basic com- 
petence, considerably more than that if their jobs are to be done as well 


as they could be. The attorneys themselves are likely to be, virtually 
without exception, at least a Standard deviation ahove the mean, if only 
because of the selection procedures in the law schools that enabled them 
to become lawyers in the first place. It remains true that part of the suc- 
cess of the law firm depends on qualities that are only slightly related to 
cognitive skills—the social skills involved in getting new business, for 
example. And attorneys in almost any law firm can be found shaking 
their heads over the highly paid (and smart) partner who is coasting on 
his subordinates’ talents. But the overall degree of cognitive stratifica- 
tion in a good law firm is extremely high. And note an important dis- 
tinction: It is not that stratification mithin the law firm is high; rather, 
ehe entire workplace represents a Stratum highly atypical of cognitive 
ability in the population at large. 

These rarefied environments are becoming more common because 
the jobs that most demand intelligence are increasing in number and 
economic importance. These are jobs that may be conducted inclois- 
tered settings in the Company of other smart workers. The brightest 
lawyers and bankers increasingly work away from the courtroom and the 
bank floor, away from all excepr the most handpicked of corporate 
clients. The brightest engineers increasingly work on problems that 
never require them to visit a construction site or a shop floor. They can 
query their Computers to get the answers they need. The brightest pub¬ 
lic policy specialists shuttle among rhink tanks, hureaucracies, andgrad- 
uate schools of public policy, never having to encounter an angry voter. 
The brightest youngsters launch their careers in business by getting an 
M.B.A. from a top business school, thence to climb the corporate lad- 
der without ever having had to seil soap or whatever to the Company’s 
actual customers. ln each example, a specialized profession with in the 
profession is developing that looks more and more like academia in the 
way it reeruits, insulares, and isolates members of the cognitive elite. 

R esidential Segregation 

As soon as a town grows larger than a few dozen households in size, it: 
Starts to develop neighborhoods. As towns become cities, this tendency 
becomes a reliable law of human communities. People seek out com- 
fortahle neighborhoods they can afford. For some people, this will mean 
looking for a particular kind of setting. Parents with young children typ- 
ically want parks, good schools, and neighbors with young children. Sin- 
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gle people in their twenties and rhirties makinggood money often grav- 
itat.e toward upscale urban neighborhoods with lots of places to go and 
things to do. 

The result is to produce neighborhoods with a high level ot socio- 
economic partitioning. The factory worker seldom lives next door to 
ehe executive, and this was as tme in 1900 as in the last years of the 
Century. The wealthy people have always been the most mobile. Hut in 
the late twentieth Century, the most mobile people are increasingly 
drawn from the cognitive elite. In thinking about these changes, we will 
focus on their implications for the way that the children of the cogni- 
tive elite are raised, for therein lies one of the main potential sources of 
trouble. 

First, the urhanization of the nation has meant that a much smaller 
Proportion of the population grows up in places where socioeconomic 
mixing occurs natural ly. Given a small enough town, the re are not 
enough elementary schools to segregate the children efficiently. The 
children of the local upper crust may live on the Street with the large 
houses, hur there are not enough of them to fill up a whole school. Af¬ 
ter elementary school, every child in the town goes to the same middle 
school and high school. Such towns now constitute a shrinking Pro¬ 
portion of the population, however. As of 1990, 78 percent of the over- 
all population lived in metropolitan areasQ 

Cognitive Segregation is also being intensified by failures of gov- 
ernment in large cities. As urban school Systems deteriorate, people 
with money relocate to rieh suburbs because that is where the good 
public school Systems are; if they remain in the city, they send their 
children to private schools, which are even more homogeneous. 
As crime rates rise, people with money relocate to suburbs where the 
crime rates are low, or they concentrate ever more densely within 
the safer parts of the city. As urhan tax rates rise, the middle dass 
flees, leaving behind even more starkly segregated poles of rieh and 
poor. 

Bright working-class youngsters mix with children of every other 
level of ability in elementary school, hut they are increasingly likely to 
be drawn away to the more intellectually homogeneous high school 
courses, thence to College. Much of the cognitive talent that used to he 
in the working-class neighborhood is heing whisked up and out of the 
community through an educational System that is increasingly driven 
by academic performance. Because of residential Segregation, the chil- 


dren ot lawyers, physicians, College protessors, engineers, and business 
executives tend to go to schools with each other’s children, and seldom 
with the children of cah drivers or assemhly-line workers, let alone with 
the children of weif are recipients or the chronically unemployed. They 
may never go to school with children representative of the whole ränge 
of cognitive ability. This tendency isexacerbated by another force work¬ 
ing in the background, genes. 

GENETIC PARTITIONING 

Twenty years ago, one of us wrote a book that created a stir because it 
discussed the heritahility of IQ and the relationship of intelligence to 
success in Life, and foresaw a future in which socioeconomic Status would 
increasingly be inherited. The logic of the argument was couched in a 
syllogism: 

• lf differences in mental abilities are inherited, and 

• lf success requires those abilities, and 

• Ii earnings and prestige depend on success, 

• Then social Standing (which reflects earnings and prestige) will be 
based to some extent on inherited differences among people .“' 1 

As stated, the syllogism is not fearsome. lf intelligence is only trivially 
a matter of genes and if success in lifo is only trivially a matter of intel- 
ligence, then success may be only trivially inherited. 

Hou> Much ls IQ a Mauer of Genes? 

ln fact, IQ is substantially heritable. The state of knowledge does not 
permit a precise estimate, but half a Century of work, now amounting to 
hundreds of empirical and theoretical studies, permits a broad conclu- 
sion that the genetic component of IQ is unhkely to be smaller than 40 
percent or higher than 80 percent Q The most unambiguous direct es- 
timates, based on identical twins raised apart, produce some of the high- 
est estimates of heritahility. 2 ^ For purposes of this discussion, we will 
adopt a middling estimate of 60 percent heritahility, which, by exten- 
sion, means that IQ is about 40 percent a matter of environment. The 
balance of the evidence suggests that 60 percent may err on the low side. 
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Because IQ and genes Was been such a sensitive topic, it is worth a 
short digression to give some idea of where these estimates come from 
and how rrustworthy they are. 

First, consider the question that heads this section, not its answer. 
What we want to know is how mach of the Variation in IQ in a popula - 
tion —the aggregated differences among the individuals 1 " 1 —is due to 
variations in genetic endowments and how much is due to variations in 
environment. If all the population Variation in IQ is due to variations 
in environment, then the /umtabi/ity is 0; ( ‘ :8f if half is due to environ¬ 
mental variations, ir is .5; if none is due to environmental variations, it 
is 1.0. Heritahility, in other words, is a ratio that ranges between 0 and 
1 and measures the relative contribution of genes to the Variation oh- 
served m a trait. 

Specialists have come up with dozens of procedures for estimating 
heritahility. Nonspecialists need not coneern themselves wirb nuts and 
holts, but they may need to he reassured on a few basic points. First the 
heritahility of any trait can be estimated as long as its Variation in a pop¬ 
ulation can he measured. IQ meets that criterion handily. There are, in 
fact, no other human traits—physical or psychological—that provide as 
many good data for the estimation of heritahility as the IQ. Second, her- 
itahility describes something about a population of people, not an indi¬ 
vidual. It makes no more sense to talk about the heritahility of an 
individual $ IQ than it does to talk ahout bis birrhrate. A given indi- 
vidual's IQ may have been greatly affected by his special circumsrances 
even though IQ is substantially heritable in the population as a vvhole. 
Third, the heritahility of a trait may change when the conditions pro- 
ducing variarion change. If, one hundred years ago, the variations in ex- 
posure to education were greater than they are now (as is no douht the 
ca.se), and if education ls one source of Variation in IQ, then, other 
things equal, the heritahility of IQ was lower then than ir is now. 

This last point is especially important in the modern socieries, wirh 
their intense efforts to equalize opportunity. As a general rule, as Envi¬ 
ronments become more uniform , heritainlity rises. When heritahility rises, 
children resemble their parents more, and siblings increasingly resem- 
ble each other; in general, Family members become more similar to each 
other and more different from people in other families. It is the central 
irony of egalitarianism: Uniformity in society makes the members of 
families more similar to each other and members of different families 
more different. 


Now for the answer to the question, How much is IQ a matter of 
genes? Heritahility is estimated from data on people with varying 
amounts of genetic overlap and varying amounts of shared environ¬ 
ment. Broadly speaking, the estimates may be characrerized as direct or 
indirectQ Direct estimates are based on samples of blood relatives who 
were raised apart. Their genetic overlap can he estimated from Basic ge¬ 
netic considerations. The direct methods assume that the correlations 
between them are due to the shared genes rather than shared environ- 
ments because they do not, in fact, share environments, an assumption 
that is more or less plausible, given the particular conditions of the study. 
The purest of the direct comparisons is based on identical (monozygoric, 
MZ) twins reared apart, often not knowing of each others existence. 
Identical twins share all their genes, and if they have been raised apart 
since birth, then the only environment they shared was that in the 
womb. Except for the effects on their lQs of the shared uterine envi- 
ronment, their IQ correlation directly estimates heritahility. The most 
modern srudy of identical twins reared in separate homes suggests a her- 
itability for general intelligence between .75 and .80, a value near the 
top of the ränge found in the Contemporary technical literature. 11 Other 
direct estimates use data on ordinary siblings who were raised apart or 
on parents and their adopted-away children. UsuaLly, the heritahility es¬ 
timates from such data are lower hut rarely below .4. 1 ^ 1 

Indirect methods compare the IQ correlations between people with 
different levels of shared genes growing up in comparable environ- 
ments—siblings versus half-siblings or versus cousins, for example, or 
MZ twins versus fratemal (dizygoric, DZ) twins, or nonadoptive siblings 
versus adoptive siblings. The underlying idea is that, for example, if full 
siblings raised in the same home and half-siblings raised in the same 
home differ in their IQ correlations, it is because they differ in the Pro¬ 
portion of genes they share: full siblings share about 50 percent of genes, 
half siblings about 25 percent. Similarly, if siblings raised in unshared 
environments and cousins raised in unshared environments differ in 
their IQ correlations, ir is because of the differing degrees of genetic 
overlap between cousins and siblings and not because of differing envi¬ 
ronmental influences, which are unshared by definition. And so on. 
Fleshed out in some sort of Statistical model, this idea makes it possible 
to estimate the heritahility, hut the modeling can get complex. Some 
studies use mixtures of direct and indirect methods. 1 *' 1 

The technical literature is filled with varying estimates of the heri- 
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tability of IQ, owing to the varying models being used for estimation 
and to the varying sets of data. Some people seem eager to rhrow up 
their hands and deelare, “No one knows (or can know) how heritahle 
IQ is.” But that reaction is as unwarranted as it is hasry, if one is eon- 
tent, as we are, to accept a ränge of uncertainty about the heritability 
that specialists may find nerve-racking. We are contenr, in other words, 
to say that the heritability of IQ falls somewhere within a broad ränge 
and that, for purposes of our discussion, a value of .6 ±.2 does no vio- 
lence to any of the competent and responsible reeent estimates. The 
ränge of .4 to .8 includes virtually all reeent (since 1980) estimates— 
competent, responsible, or otherwise. |MJ 

Reeent studies have uncovered other salient facts about the way IQ 
scores depend on genes. They have found, for example, that the more 
general the measure of intelligence—the closer it is to g—the higher is 
the heritability. h Also, the evidence seems to say that the heritability 
of IQ rises as one ages, all the way from early childhood to late adult- 
hoodd 6 This means that the Variation in IQ among, say, youths ages 18 
to 22 is less dependent on genes than that among people ages 40 to 44- 1 '' 1 
Most of the traditional estimates of heritability have been based on 
youngsters, which means that they are likely to underestimare the role 
of genes later in life. 

Final ly, and most surprisingly, the evidence is growing that whatever 
Variation is left over for the environment to explain (i.e., 40 percent of 
the total Variation, if the heritability of IQ is taken to be .6), relatively 
little can be traced to the shared environments created by familiesQ It 
is, rather, a set of environmental mfluences, mostly unknown at present, 
that are experienced by individuals as individuals. The fact that family 
members resemble each other in intelligence in adulthood as much as 
they do is very largely explained by the genes they share rather than the 
family environment they shared as children. These findings suggest deep 
roots indeed for the cognitive stratification of society. 

The Syllogism in Practice 

The heritability of IQ is substantiaL In Chapters 2 and 3, we presented 
evidence that the relationship of cognitive ability to suecess in life is far 
from trivial. Inasmuch as the syüogism’s premises cannot he dismissed 
out of Hand, neither can its conclusion that success in life will be based 
to some extent on inherited differences among people. 1 ' 9 * 


Furthermore, a variety of other scientific findings leads us to conclude 
rhat the heritability of success is going to increase rather thandiminish. 
Regin with the limits that heritability puts on the ability to manipulate 
intelligence, by imagining a United States that has magically made good 
on the Contemporary ideal of equality. Every child in this imaginary 
America experiences exactly the same environmental effects, for good 
or ill, on bis or her intelligence. How much intellectual Variation would 
remain? If the heritability of IQ is .6, the Standard deviation of IQ in 
our magical world of identical environments would be 1 1.6 instead of 
15 (see the note for how this calculation is done)—smaller, hut still leav- 
ing a great deal of Variation in intellectual talent that could not be re- 
duced further by mere equalization. 1401 As we noted earlier, when a 
society makes good on the ideal of letting every youngster have equal 
access to the rhings that allow latent cognitive ability to develop, it is 
in effect driving the environmental component ol IQ Variation closer 
and closer to nil. 

The United States is still very far from this state of affairs at the ex¬ 
tremes. If one thinks of bahies growing up in slums with crack-addicted 
mothers, at one extreme, compared to children growing up in affluent, 
eulturally rieh homes with parents dedicated to squeezing every last 
IQ point out of them, then even a heritability of .6 leaves room for 
considerable change if the changes in environment are commetv 
surably large. We take up the evidence on that issue in detail in 
Chaprer 17, when we consider the many educational and social In¬ 
tervent ions that have attempted to raise IQ. But rhose are, hy defini- 
rion, the extremes, the two tails of the distribution of environments. 
Moving a child from an environment that is the \-ery worst to the very 
best may make a big difference. In reality, what most interventions 
accomplish is to rnove children from avvful environments to oncs that 
are merely below average, and such changes are limited in their poten¬ 
tial consequences when heritability so constrains the limits of environ¬ 
mental effects. 1411 

So while we can lock forward to a future in which Science discovers 
how to foster intelligence environmentally and how to use the science 
humanely, inherited cognitive ability is now extremely important. In 
this sense, luck conrinues to matter in life’s outcomes, but now it is more 
a matter of the IQ handed out in life’s lottery than anything eise about 
circumstanees. High cognitive ability as of the 1990s means, more than 
even hefore, that the chances of success in life are good and gerting bet- 
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ter all the time, and these are decreasingly affected by the social envi- 
ronment, which by extension indicates that they must be increasin^iy af¬ 
fected hy genes. Holding these thoughts in mind, now consider the 
phenomenon known as assortative mating. 

Love , Marriage, and IQ 

The old saw notwithstanding, opposites do not really attract when it 
comes to love and marriage. Likes attract. ln one of the classic papers, 
originally published in 1943, rwo sociologists studied 1,000 engaged 
couples in Chicago, expecting to find at least some traits in which op¬ 
posites did indeed attract. But out of fifty-one social characteristics stud¬ 
ied, the sign of the correlation was positive for every single one. For all 
but six of the fifty-one traits, the correlations were staristically signifi- 
cant. k Modest but consistently positive correlations have been found 
for a wide variety of physical traits as well, ranging from stature (the cor- 
relations from many studies average about +.25) to eye color (also av- 
eraging about +.25, even within national populations). 41 

Of the many correlations involving hushands and wives, one of the 
highest is for IQ. In most of the major studies, the correlation of hus- 
band and wife IQ has been in the region of .4, though estimates as low 
as .2 and as high as .6 have been observed. Jensen’s review of the liter¬ 
al ure in the late 1970s found that the average correlation of forty-three 
spouse correlations for various testsof cognitive ahility was + .45, almost 
as high as the typical correlation of IQs among sihlings. 1441 

If thc Propensity to Mate by Cognitive Ahility Has Remained the Same: 

When the propensity to mate by cognitive ahility is combined with the 
educational and occupational stratification we have described, the irn- 
pact on the next generation will be larger than on the previous one, even 
if the underlying propensity to mate by cognitive ability remains the same. 

Consider 100 Harvard/Radcliffe marriages from the dass of 1930 ver¬ 
sus another 100 from the dass of 1964. We stipulate that the propen¬ 
sity to marry people of similar intelligence has not changed in the 
intervening thirty-four years. Nonetheless, the ones who marry in 1964 
will produce a set of children with considerably higher me an IQ than 
the ones who married in 1930, because the level of intelligence at Har¬ 
vard and Radcliffe had risen so dramatically. 

How much difference can it make? If the average Harvard man in the 


dass of 1930 married the average Radcliffe woman in the same gradu- 
ating dass—as far as we can teil, hoth would have had IQs of about 
117—then the expected mean IQ of their children, after taking regres- 
sion to the mean into account, will be about 114, or at the 82d per- 
centile. 14 ^ 1 But average Harvard and Radcliffe newlyweds in the dass of 

1964 were likely to have children with a mean IQ of about 124, at the 
95th percentile. ln terms of distributions rather than averages, about a 
third of the children of the Harvard newlyweds of 1930 could be ex¬ 
pected to have IQs of less than 110—not even College material by some 
definitions.* 461 ln contrast, only 6 percent of the children of the Harvard 
newlyweds of 1965 could be expected to fall below this cutoft. Mean- 
while, only about 22 percent of the children of the 1950 newlyweds 
could be expected to match or exceed the average of the children of the 

1965 newlyweds. ln such numbcrs lurk large social effects. 

If the Propensity to Mate by Cognitive Ahility Has Increased: 

We have been assuming that the propensity to mate by IQ has remained 
the same. In reality, it has almost certainly increased and will continue 
to increase. 

We hedge with “almost” because no quantitative studies teil wherher 
assortative mating by intelligence has been increasing recently. But we 
do know from sociologist Robert Mare of the University of Wisconsin 
that assortative mating by educational level increased over the period 
from 1940 to 1987—an increase in “homogamy,” in the sociologists’ lan- 
guage. The increase in homogamy was most pronounced among College- 
educated persons. Specifically, the odds of a College graduate’s marrying 
someone who was not a College graduate declined from 44 percent in 
1940 to 35 percent in Mares most recent data (for 1985 to 1987). The 
Proportion hit a low of 33 percent in the 1980 data. i4;! Because educa¬ 
tional attainmenr and IQ are so closely linked and became more c lose ly 
linked in the postwar period, Mares resultssuggest a suhstantial increase 
in assortative mating hy IQ, with the greatest change occurring at the 
upper levels of IQ. 

Mare identifies some of the reasons for increased homogamy in the 
trends involving educational attainment, age at leaving school, and age 
at marriage. But there are a variety of other potential explanations 
(some of which he notes) that involve cognitive ahility specifically, For 
example, a smart wife in the 1990s has a much greater dollar payoff for 
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a man than she did fifty years ago. 4h The feminist movement has also 
increased the likelihood of marrying by cognitive ahility. 

First, the feminist revolution in practice (which began in the 1950s, 
antedating the revolution in rhetoric) drastically increased the odds 
that bright young women will be thrown in contact vvith bright young 
men during the years when people choose spouses. This is most ohvious 
in College, where the proportion of women continuing to College surged 
from about half the proportion of men in 1950 to equality in 1975A 1t 
was not just the numbers, however. All of the elite men's Colleges be- 
came coeducational, as did many of elite women’s Colleges. Strict pari¬ 
etal rules gave way to coeducational dorms. Intelligence has always been 
an important factor for sorting among prospective mates, but compari- 
son shopping at single-sex Colleges like Vassar or Yale was a struggle; the 
feminist revolution in the universities led to an explosion of intorma- 
tion, as it were, that made it easier for the brightest to pair up. 

The same phenomenon extended to the workplace. Large propor- 
tions of the cognitive elite delay marriage until the later twenties or 
even thirties. Only a fevv decades ago, delay tended to dilute the chances 
of assortative mating by IQ. In a world where the brightest women were 
usually not in the work force or were in a few resrricted occupations, the 
pool from which a man in bis late twenties found a hride were moder- 
ated primarily by socioeconomic Status; he found bis mate among the 
women he encountered in his neighhorhood, church, social organi:a- 
tions, and other settings that were rnatched mostly by scx;ioeconomic 
Status. But today background Status is less important than intelligence. 
The young man newly graduated from his elite law school joins his elite 
New York firm, thereupon encountering young women, just as high ly 
selected for cognitive ahility as he was, in the adjacent offices at his own 
firm, at business lunches, across the table in negotiations, on a daily ha- 
sis. The opportunities for propinquity to work its magic were increased 
in the workplace too, and wäll continue to increase in the years to come. 

The second effect of feminism is less ponderable but may be impor¬ 
tant anyway. Not so many years ago, the cliche was true: brains were not 
considered sexy in a woman, and many men undervalued brains as an 
asset in a prospective spouse or even feit threatened by smart women. 
Such attitudes may linger in some men, but feminism has surely weak- 
ened them and, to some degree, freed relationships among men and 
women so that a woman's potential for occupational success can take as 


dominant a place in the man’s marriage calculus as it has traditionally 
taken in the woman’s. 1 ' 01 We speculate that the effect has been most lib- 
erating among the brightest. If we are right, then the trends in educa- 
tional homogamy that Mare has demonstrated are an understated 
reflection of what is really going on. Intermarriage among people in the 
top few percentiles of intelligence may be increasing far more rapidly 
than suspected. 

THE LIMITS OF CHURNING 

American society has historically been full of churning, as new groups 
came to this country, w r orked their way up, and joined the ranks of the 
rieh and povverful. Meanwhile, some of the children of the rieh and pow- 
erful, or their grandchildren, were descending the ladder. This process 
has made for a vibrant, selbrenewing society. In depressing contrast, we 
have been envisioning a society that hecomes increasingly quiescent at 
the top, as a cognitive elite moves toward the upper ineome brackets 
and runs most of the institutions of society, taking on some of the char- 
acteristics ol a easte. 

Is the Situation really so extreme? To some exrent, not yet. For ex- 
ample, national surveys still indicate that fewer than 60 pereent in the 
topquartile of intelligenceactually complete a bachelor’sdegree. |M| This 
would seem to leave a lot of room for churning. But when we focus in- 
stead on the students in the top few centiles of cognitive ahility (from 
which the nation’s elite Colleges pick almost exclusively), an exrremely 
high proportion are already being swept into the comfortable precincts 
of the cognitive elite. v In the NLSY, for example, 81 pereent of those 
in the top 5 pereent of IQ had obtained at least a bachelor s degree by 
1990, when the youngest members of the sample were 25 years oldA 

When we examine the remaining 19 pereent who had not obtained 
College degrees, the efficiency of American society in pushing the most 
talented to the top looks even more impressive. For example, only a 
small portion of that 19 pereent were smart students who had been raised 
in a low-income family and did not get to College for lack of opportu- 
nity. Only 6 pereent of persons in the top five IQ centiles did not have 
a College degree and came from families in the lower halt of socioeco- 
nomic Status. ,4 

If this 19 pereent of higfvIQ persons-without-B.A.s Joes not fit the 
stereotype of the deprived Student, who were they? Some were he- 
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comin^ members of rhe cognitive elite even though they do not have 
a College degree. Bill Gates, College dropout and founder of Microsoft, 
is the larger-than-life prototype. Five percentage points of the 19 per- 
cent were working in one of the high-IQ occupations, indicating that 
they were probably of the minor-league Bill Gates variety (corrobo- 
rated by their incomes, which were high). Of the remaining 14 per- 
cent who were not working in high-IQ occupations, a quarter had 
family incomes in excess of $50,000 while they were still only in their 
late twenties and early thindes, putting them in the top 20 percent of 
family incomes for their age groupT In total, roughly half of these smart 
non-college graduates are already taking their place among the smart 
College graduates, by virtue of their incomes, their occupations, or both. 
It seems a safe bet that the neighborhoods where they live and the way 
they socialize their children are going to he indistinguishable from 
those of most of their counterparts in the top five cent des who com' 
pleted College. 

There is douhtless some relatively small fraction of those in the rop 
5 percent intellectually who will never rise ro successful positions, 
whether because of lack of motivation or objective barriers. Bur w’hat a 
small percentage of the highly talented they are. And we may add a re- 
minder that we are vvatching an ongoing process. Think back to Chap- 
rer 1 and imagine the trend line from 1900 to 1990 stretched out to, 
say, 2020. Whatever the nurnber of the cognitive elite who slip hetween 
the cracks now, it is a much smaller figure than it was in rhe 1950s, rad- 
ically smaller than it was in the 1900s, and presumably it will get smaller 
still in the future. 

These observations have several implications. At a practical policy 
level, the most ohvious is that programs to expand opportunity for the 
disadvantaged are not going to make much difference in getting the 
most talented youths to College. An extremely high proportion of those 
who want to go are already going. The broader implication is that the 
funneling System is already functioning at a high level of efficiency, 
therehy promoting three interlocking phenomena: 

1. The cognitive elite is getting richer, in an era when everybody eise 

is having to struggle to stay even. 

2. The cognitive elite is increasingly segregated physically from every- 

one eise, in both the workplace and the neighhorhood. 

3. The cognitive elite is increasingly likely to intermarry. 


These phenomena are driven hy forces that do not lend themselves 
ro easy reconfiguration hy politicians. As we leave Part 1, here is a topic 
to keep in the back of your mind: What if the cognitive elite were to 
become not only richer than everyone eise, increasingly segregated, and 
more genetically distinct as time goes on but were also to acquire com¬ 
mon political interests? What might those interests be, and how con- 
gruent might they he with a free society? How decisively could the 
cognitive elite affect policy if it were to acquire such a common politi¬ 
cal mterest? 

These issues will re turn in the last chapters in the book. They are 
postponed for now, because we must first explore the social problems 
that might help create such a new political coalition. 





PART II 


Cognitive Classes and 
Social Behavior 


Whereas Part 1 dealt with positive outcomes—attainment of high edu- 
cational levels, prestigious occupations, high incomes—Part II presents 
our best estimate of how much intelligence has to do with Americas 
most pressing social problems. The short answer is U quite a lot,” and the 
reason is that different levels of cognitive ability are associated with dif- 
ferent patterns of social behavior. High cognitive ability is generally as¬ 
sociated with socially Jesirable behaviors, low cognitive ability with 
socially undesirable ones. 

“Generally associated with“ does not mean “coincident with.” For 
virtually all of the topics we will he discussing, cognitive ability aceounts 
for only small to middling proportions of the Variation among people. 
It almost always explains less than 20 percent of the variance, to use the 
statistician’s term, usually less than 10 percent and uften less than 5 per¬ 
cent. What this means in English is that you cannot predict what a given 
person will do from bis IQ score—a point that we have made in Part 1 
and will make again, for it nceds repeating. Ön the other hand, despite 
the low association at the individual level, large differences in social he- 
havior separate groups of people when the groups differ intellectually 
on the average. 

We will argue that intelligence itself, not just its correlation with 
socioeconornic Status, is responsible for these group differences. Our 
thesis appears to he radical, judging from its neglect by other social 
scientists. Could low intelligence possibly be a cause of irresponsible 
childbearing and parenting behaviors, for example? Scholars of child- 
hearing and parenting do not seem to think so. The 850 double-column 
pages of the authoritative Handhook of Marriage and the Family, for 
example, allude to intelligence about half a dozen times, always in 
passing. 1 Could low intelligence possibly he a cause of unemployment 
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or poverty? Only a Scattering of economists have broached the pos- 
sibility. 2 

This neglect points to a gaping hole in the state of knowledge about 
social behavior. 1t is not that cognitive ability has been considered and 
found inconsequential but that it has barely been considered at all. The 
chapters in Part II add cognitive ability to the mix of variables that so¬ 
cial scientists have traditionally used, Clearing away some of the mys- 
tery that has surrounded the nation’s most serious social problems. 

We will also argue that cognitive ability is an important factor in 
thinking about the nature of the present problems, whether or not cog¬ 
nitive ability is a cause. For example, if many of the single women who 
have babies also have low IQ, it makes no difference (in one sense) 
whether the low IQ caused them ro have the babies or whether the path 
of causation takes a more winding route. The reality that less intelligent 
women have most of the out-of-wedlock babies affects and constrains 
public policy, whatever the path of causation. The simple correlation, 
unadjusted for other factors—what social scientists called the zero-or- 
der correlation—between cognitive ability and social hehaviors is so- 
cially important. 

The chapters of Part II cover a wide ränge of topics, each requiring 
extensive documentation. Many statistics, many tables and graphs, 
many citations to technical joumals crowd the pages. But the chapters 
generally follow a similar pattem, and many of the complexiries will 
be less daunting if you understand three basics: the NLSY, our use of 
cognitive classes, and our Standard operating procedure for Statistical 
analysis. 

THE NLSY 

In Part I, we occasionally made use of the National Longitudinal Sur- 
vey of Youth, the NLSY. In the chapters that follow, it will play the cen¬ 
tral role in the analysis, with other studies called in as avaitable and 
appropriate. 

Until a few years ago, there were no answers to many of the quesrions 
we will ask, or only very murky answers. No one knew what the rela- 
tionship of cognitive ability to illegitimacy might be, or even the rela- 
tionship of cognitive ability to poverty. Despite the millions of mental 
tests that have been given, very few of the systematic surveys, and some- 
times none, gave the analyst a way to conclude with any conf idence that 


this is how IQ interacts with behavior X for a representative sample of 
Americans. 

Several modern sources of data have begun to answer such quesrions. 
The TALENT database, the huge national sample of high school stu- 
dents taken in 1961, is the most venerable of the sources, but its follow - 
up surveys have been limited in the ränge and conrinuity of their data. 
The Panel Study of Income Dynamics, begun in 1968 and the nation’s 
longest-running longitudinal database, administered a briet vocabulary 
test in 1972 to part of its sample, but the scores allow only rough dis- 
criminations among people in the lower portions of the distribution of 
intelligence. The National Longitudinal Survey begun by the Depart¬ 
ment of Education in 1972 (not to be confused with the NLSY) pro- 
vides answers to many questions associated with educational outcomes. 
The department’s more ambitious study, High School and Beyond, con- 
ducted in the early 1980s, is also useful. 

But the mother lode for scholars who wish to understand the rela- 
tionship of cognitive ability to social and economic outcomes is the 
NLSY, whose official name Ls the National Longitudinal Survey of La¬ 
bor Market Experience of Youth. When the study began in 1979, the 
participants in the study were aged 14 to 22. 111 There were originally 
12,686 of them, chosen to provide adequate sample sizes for analyzing 
crucial groups (for example, by Oversampling blacks, Latinos, and low- 
income whites), and also incorporating a weighting System so that 
analysts could determine the correct estimates for nationally represen¬ 
tative sample» of their age group. Sample attrition has been kept low 
and the quality of the data, gathered by the National Opinion Research 
Council under the supervision of the Center for Human Resources Re¬ 
search at Ohio State University, has been excellent. 

The NLSY is unique because it combines in one database all the eh 
ements that hitherto had to be studied piecemeal. Only the NLSY com- 
bined detailed Information on the childhood environment and parental 
socioeconomic Status and subsequent educational and occupational 
achievement and work history and family formation arid —erucially for 
our inrerests—derailed psychometric measures of cognitive skills. 

The NLSY acquired its cognitive measures by a lucky coincidence. 
In 1980, a year after the first wave of data colleetion, the Department 
of Defense decided to update the national norms for its bat tery' of en- 
listment tests. At the time, it was still using test scores from World War 
11 recruits as the reference population. Because the NLSY had just gone 
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through che technically difficult and tedious cask ot selecting a natioiv 
ally representative sample, the Department ot Defense proposed ro 
piggyhack its study on theNLSY sample. 4 And so the NLSY beeame the 
beneficiary of an expensive, welhdesigned set ot cognitive and aptitude 
tests that weve given under caretully controlled conditions to almost 94 
percent ot the 12,686 young men and women in the NLSY sample. 

The measure of cognitive ahility extracted froin this test battery was 
the Armed Forces Qualitication Test, the AFQT. 1t is what the psycho- 
metricians call “highly g-loaded,” meaning that it is a good measure ot 
general cognitive ability. 6 The AFQT s most significant shortcoming is 
that it is truncated at the high end; about one person in a rhousand gets 
a perfect score, which means both that the test does not discriminate 
aniong the very highest levels of intelligence and that the variance in 
the population is somewhat understated. Otherwise the AFQT is an ex- 
cellenr test, with psychometric reliability and validity that compare well 
with those of the other major tests of intelligence. Because the raw 
scores on the AFQT mean nothing to the average reader, we express 
them in the IQ metric (with a mean of 100 and a Standard deviation ot 
1 5) or in centiles. Also, we will subsequently refer to them as “IQ scores,“ 
in keeping with our policy of using IQ as a generic term for intelligence 
test scores. When we use centiles, they are age equated. A centile score 
ot 45, for example, means that the subject would rank in the 45rh per- 
centile of everyone born in the same year, if everyone took the AFQT. 1,1 
A final point about the presentation of NLSY results is that all results 
are based on i veighted arialyses , which means that all may he interpreted 
in terms of a nationally representative sample ot Americans in theNLSY 
age group. We use data collected through the 1990 interview' wave. 

THE DEFINITION OF COGNITIVE CLASSES 

To this point, we have heen referring to cognitive classes without heing 
specific, ln these chapters, we divide the world into cognitive classes— 
five of them, because that has been the most common number among 
sociologisrs who have broken down socioeconomic Status into classes 
and because five allows the natural groupings of “very high,“ “high,” 
“mid,” “low,” and “very low.” We have chosen to break the intervals at 
the 5th, 25th, 75th, and 95th percentiles of the distribution. The figure 
shows how this looks for a normally distributed population. 

Break points are arbitrary, but we did have some reasons for these. 


Defining the cognitive classes 


The Distribution of IQ 



Mainlv, we wanred to focus on rhe extremes; hence, we avoided a sim¬ 
ple hreakdown intoquintiles (i.e., into equal cuts of 20 percent). A great 
deal of interest goes on within the top 20 percent and hotrom 20 pei- 
eenr of the population. lndeed, if the sample sizes were large enough, 
we would have defined the top cognitive dass as consisting of the top 
1 or 2 percent of rhe population. Important gradations in social behav- 
jor occasionally separate the top 2 percent from the next 2 percent. This 
is in line with another of rhe ehernes that we keep reiterating because 
they are so easily forgotren: You—meaning the self-selected person who 
has read this tat into this book—live in a world that probahly looks noth¬ 
ing like the figure. In all likelihood, almost all of your friends and Pro¬ 
fessional associates helong in that top Class 1 slice. Your friends and 
associates whom you consider to be unusuaüy slow are probahly some- 
where in Class II. Those whom you consider to he unusuaüy hright are 
probahly somewhere in the tippet fraction of the 99th centile, a very 
thin slice of the overall distribution. In defining Class 1, which we will 
use as an operational defmition of the more amurphous group called the 
“cognitive elite,” as heing the top 5 percent, we are heing quite inclu¬ 
sive. It does, after all, embrace some 12 1/2 million people. Class Ul, the 
normals, comprises half of the population. Classes 11 and IV each com- 
prises 20 percent, and Class V, like Class I, comprises 5 percent. 
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The labeis for the classes are the best we could do. It is impossible to 
devise neutral terms for people in the lowesr classes or rhe highest ones. 
Our choice of “very dull” for Class V sounds to us less damning than the 
Standard “retarded” (which is generally defined as he low an IQ of 70, 
with “borderline retarded” referring to IQs between 70 and 80). “Very 
bright” seems more focused than “superior,” which is the Standard term 
for people with IQs of 120 to 130 (those with IQs above 1 30 are called 
“very superior” in that nomenclature). |8i 

PRESENTING STATISTICAL RESULTS 

The basic tool for multivariate analysis in the social Sciences is known 
as regression analysis. 1 ^ The many forms of regression analysis have a 
common structure. There is a result to explain, the dependent variable . 
There are some things that might be the causes, the independent varP 
ables. Regression analysis teils how much each cause actually af+ects the 
result, taking the role of all the other hypothesized causes into account —an 
enormously useful thing for a Statistical procedure to do, hence its wide- 
spread use. 

In most of the chapters of Part II, we will be looking at a variety of 
social behaviors, ranging from crime to childbearing to unemployment 
tt") citizenship. In each instance, we will look first at the direct rela- 
tionship of cognitive ability to that behavior. After observing a staris- 
tical connection, the next question to come to mind is, What eise might 
be another source of the relationship? 

In the case of IQ, the obvious ans wer is socioeconomic Status. To what 


What Is a Variable? 


The word variable confuses some people who are new to statistics, because 
it sounds as if a variable is something that keeps changing. In facr, it is 
something that has different values arnong the members of a population. 
Consider weight as a variable. For any given observation, weight is a sin- 
gle numher: the number of pounds that an object weighed at the time the 
observation was taken. But over all the members of the sample, weight has 
different values: It varies, hence it is a variable. A mnemonic for keeping 
“independent” and “dependent“ straight is that the dependent variable is 
thought to depend on the values of che independent variables. 


extent is this relationship really founded on the social baekground and 
economic resources that shaped the environment in which the person 
grew up—the parents’ socioeconomic Status (SES)—rather than intel- 
ligence? Our measure of SES is an index combining indicators of 
parental educarion, income, and occupational prestige (details may be 
found in Appendix 2). Our basic procedure has been to run regression 
analyses in which the independent variables include IQ and parental 
SES. 1101 The result is a Statement of the form: “Here is the relationship 
of IQ to social behavior X after the effects of socioeconomic baekground 
have heen extracted,” or vice versa. Usually this takes the analysis most 
of the distance it can sensibly be pushed. If the independent relation¬ 
ship of IQ to social behavior X is small, there is no point in looking fur- 
rher. If the role of IQ remains large independent of SES, then it is worth 
thinking about, for it may cast social behavior and public policy in a 
new light. 

But. What About Other Explanations? 

We do not have the choice of leaving the issue of causation at that, how- 
ever. Because inrelligence has heen such a taboo explanation for social 
behavior, we assume that our conclusions will often be resisted, if not 
condemned. We can already hear crirics saying, “If only they had added 
this other variable to the analysis, they would have seen that intelU- 
genee has nothing to do with X.” A major part of our analysis accord- 
ingly has been to anticipate what other variables might be invoked and 
seeing if they do in fact attenuate the relationship of IQ to any given 
social behavior. This was not a scattershot efforr. For each relationship, 
we asked ourselves if evidence, theory, or common sense suggests an- 
other major causa! story. Sometimes it did. When looking at whether a 
new mother went on welfare, for example, it clearly was not enough to 
know the general socioeconomic baekground of the woman's parents. 
It was also essential to examine her own economic Situation at the time 
she had the baby: Whatever her IQ is, would she go on welfare if she 
had economic resources to draw on. ? 

At this point, however, Statistical analysis can hecorne a bottomless 
pit. It is not uneommon in technical joumals to read articles hui It 
around the estimated effects of a dozen or more independent variables. 
Sometimes the entire set of variables is loaded into a single regression 
equation. Sometimes sets of equations are used—modeling even more 
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complex relationships, in which all the variables can exerr mutual ef- 
tects on one another. 

Why should we not press forward: Why not also ask if religious Back¬ 
ground has an ettect on the Jecision to go on welfare, for example! 1t is 
an interesting question, as are another fifry others rhat might come to 
mind. Our principle was to explore additional dynamics when there was 
another tactor (hat was not only conceivahly important hut for clear 
logical reasons might he important because of dynamics having little or 
not/iing to du xvith IQ. T s last proviso is crucial, for one of the most 
common rnisuses ol regression analysis is to inrroduce an additional 
variable that in reality is mostly another expression of variables that are 
already in the equation. 

The Special Case of Education 

Education posed a special and continuing probleni. On the one hand, 
education can he important independent of cognitive ahility. For ex¬ 
ample, education tends to delay rnarriage and childbirth hecause the 
time and commitmenr involved in heing in school competes with the 
time and commitmenr it takes to he married or have a baby. Education 
shapes rastes and values in vvays rhat are independent of the cognitive 
ahility of the Student. Ar the sarne time, however, the role of education 
versus IQ as calculated hy a regression equation is t ricky to interpret, for 
tour reasons. 

First, the numbor ot years of education that a youth gets is caused to 
an important degree hy horh the parents’ SES and the youth’s own aca- 
demic ahility. ln the NLSY, tor example, the correlarion of years of ed¬ 
ucation with parental SES and youth’s IQ are +.50 and +.64, 
respectively. This means that when years oi education is used as an in¬ 
dependent variable, it is to some extern expressing the effects of SES 
and IQ in another form. 

Second, any role that education plays independent of intelligence is 
likely to be discontinuous. For example, it may make a big difference to 
many outcomes rhat a person has a College degree. Rut how is one to in¬ 
terpret the substantive difference hetween one year of College and two? 
Retween one year of graduate school and two? They are unlikely to he 
nearly as important as the difference hetween u a College degree” and “no 
College degree.” 

Fhird, variables that are closely related can in some circumstances 
produce a technical Problem known as muhicoüinearity, whereby the So¬ 


lutions produced by regression equations are unstable and ölten mis- 
leading. 

Foiirrh and final ly, to rake eduoation’s regression coefficient seriously 
tacitly assumes that intelligence and education could vary indepen- 
dently and produce similar results. No one can believe this to be tme in 
general: indisputably, giving nineteen years of education to a person 
with an IQ of 75 is not going to have the samt impact on life as it would 
for a person with an IQ of 125. The effects of education, whatever they 
may he, depend on the coexistenee of suitahle cognitive ahility in ways 
that often require complex and extensive modeling of interaction ef¬ 
fects—once again, prohlems that we hope others will take up hut would 
push us iar heyond the purposes oi this hook. 

Our solution to this Situation is to report the role of cognitive abil- 
ity for two subpopulations of the NLSY rhat each have the same level 
ol education: a high school diploma, no more and no less in one group; 
a hachelor’s degree, no more and no less, in the orher. This is a simple, 
hut we believe reasonable, way of hounding the degree to which cogni¬ 
tive ahility makes a difference independent of education. 

We walk through all three of these basics—the NLSY, the five cogni¬ 
tive c lasses, and the forma r for the Statistical analysis—in a Step-hy-Step 
tashion in the next chapter, where we use poverty to set the stage for 
the social behaviors ro follow. Chapter 6 returns to education, this time 
not just talking about how tar people got hut the comparative roles of 
IQ and SES in determining how tar someone gets in school. Then, se¬ 
rial im, we take up unemployment and lahor force dropout (Chapter 7), 
single-parent families and illegitimacy (Chapter 8), welfare dependency 
(Chapter 9), parenting (Chapter 10), crime (Chapter 1 1), and civic he- 
havior (Chapter 12). 

ln these eight chapters, we limit the analysis to whites, and more 
speetfieally to non-Latino whites. 11 " This is, we think, the best way to 
make yet another central point: Cognitive ahility aflects social behav- 
ior without regard to race or ethnicity. The influence of race and eth- 
nicity is deferred to Part 111. 







Chapter 5 

Poverty 


Who becomes poor. ? One familiär answer is that people who are unlucky 
enough to be born to paar parents become poor. There is some truth to this. 
Whnes , the focus of our analyses in the chapters of Part II, who grew up in 
the warst 5 percent of socioecommiic circumstances are eight ümes more likely 
to fall below the poverty line than those gmwing up in the top 5 percent of so- 
cioeconomic circumstances. But low intelligence is a strenger precursor of 
poverty than low socioeconomic background. Whites with IQs in the bottom 
5 percent of the distribution of cognitive ability are fifteen times more likely to 
be poor than those with IQs in the top 5 percent. 

How does each of these causes of poverty look when the other is hehl con- 
stant. ? Gr to put it another way: If you have to choose, is it beiter to be born 
smart or rieh! The answer is unequivocally “smart.” A white youth reared in 
a home in which the parent or parents were chronically unemployed, worked 
at only the mos t menial of Jobs, and had not gatten post ninth grade, but of 
just average intelligence—an IQ of 100—has nearly a 90 percent chance of 
being out of poverty by his or her early 30s. Conversely, a white youth born 
to a solid middle-class family but with an IQ equivalently below average faccs 
a much higher risk of poverty, despite his more fortunate background. 

When the picture is complicated by adding the effects of sex, marital Sta¬ 
tus, and years of educatum , intelligence remains more important than any of 
them, with marital Status running a dose second. Among people who are both 
smart and well educated, the risk of poverty approaches zero. But it should 
also be noted that young white adults who rnarry are seldom in poverty, even 
if they are below average in intelligence or education. Even in these more com¬ 
plicated analyses, low IQ continues to be a much scronger precurstrr of poverty 
than the socioeconomic circumstances in which people grow up. 


W e begin with poverty hecause it has been so much ar the center 
of concern about social problems. We will he asking, “What 
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causes poverty?” focusing on che role that cognirive ahility mighr play. 
Our point of departure is a quick look ac che history of poverty in ehe 
next figure, which scholars from ehe Institute for Research on Poverty 
have now enabled us to take back to the 1930s. 111 

Dramatic progress against poverty from World War II 
through the 1960s, Stagnation since then 

Proportion of Americans below the poveity line 



i i i-1-1-1— 

1940 1950 1960 1970 1980 1990 


Source.s: SAUS, various eüitions; Ross and others, 1987. 


In 1939, over half of the people of the United States lived in fami- 
lies WLth an income below the amount thar constitutes the present 
poverty line—in constant dollars, of course, This figure declined steeply 
through World War II, and then through the Truman, Eisenhower, 
Kennedy, and Johnson administrations. Then came a sudden and last' 
ing halt to progress. As of 1992, 14-5 percent of Americans were below 
the poverty line, within a few percentage points of the level in 1969. 
This history provokes three observations. 

The first is that poverty cannot he a simple, direct cause of such prob- 
lems as crime, illegitimacy, and drug abuse. Probably no single ohserva- 
tion about poverty is at once so indisputable and so ignored. It is 
indisputable because poverty was endemic at a time when those prob' 
lems were minor. We know that reducing poverty cannot, by itself, be 
expected to produce less criminality, illegitimacy, drug abuse, or the rest 


of the catalog of social prohlems, eise the history of the twentieth Cen¬ 
tury would have chronicled their steep decline. 

The second point illusrrat ed by the graph of poverty is that the pool 
of poor people must have changed over time. As late as the 1940s, so 
many people were poor in economic terms that to be poor did not nec- 
essarily menn to be distinguishable from the rest of the population in 
any orher way. To rephrase the dialogue between E Scott Fitzgerald and 
Ernest Hemingway, the poor were different from you and me: They had 
less money. Rur that was almost the only reliable difference. As affin- 
ence spread, people who escaped from poverty were not a random sam¬ 
ple of the population. When a group shrinks from over 50 percent of 
the population to the less than 15 percent that has prevailed since the 
late 1960s, the people who are left behind are likely to be dispropor- 
tionately rhose who suffer not only bad luck but also a lack of energy, 
thrift, farsightedness, determination—and brains. 

The tbird point of the graph is that some perspective is in order about 
vvhat happened ro poverty during the 1960s and the famous War on 
Poverty. The rrendline we show for ] 936-1969 would have had abour the 
same slope if we had chosen any of the decades in between to calculate it. 
The United States was not only getting richer but had been reducing the 
percentage of people below the modern poverty line for at least three 
decades hefore the 1960s came to a close. We will not reopen here the 
enntinuing debate abour why progress came to an end when it did. 

In this chapter, we explore some basic Undings about the different 
roles that intelligence and social hackground play in keeping individu- 
als out of poverty. The basics may be stated in a lew paragraphs, as we 
did in the chapters introduction. But we also want to speak to readers 
who ask, “Yes, but what about the role of. . . thinking of the many 
other potential causes of white poverty. By the end of the chapter, we 
will have drawn a controversial conclusion. How did we get there ? What 
makes us think that we have got our causal ordering right? We will walk 
through the analyses that lie behind our conclusions, taking a more 
leisurely approach than in the chapters to come. 

CAN AN IQ SCORE TAKEN AT AGE 15 BE A CAUSE OF 
POVERTY AT AGE 30? 

We need to deal at once with an issue that applies to most of the top- 
ics in Part II. We want to consider poverty as an effect rather than as a 
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cause—in social Science terminology, as a dependent, not an indepen¬ 
dent, variable/ Intelligence will be evaluated as a factor that bears on 
becoming poor. But what, after all, does an intelligence test score mean 
for an adolescent who bas grown up poor? Wouldn’t his test: score have 
been higher if his luck in home environment had been better? Can IQ 
be causing poverty if poverty is caiising IQ? 


The Stability of IQ over the Life Span 


The stability ot IQ over time in the general population has been studied 
for decades, and the main findings are not in much dispure among psy- 
chometricians. Up to about 4 or 5 years of age, measures of IQ are not of 
much use in predicting later IQ. Indeed, you will get a better prediction of 
the child’s IQ at age 15 by knowing his parenrs’ IQ than hy any test of the 
child given before age 5. 1 Between ages 5 and 10, the tests rapidly become 
more predictive of adult IQ. 141 After about the age of 10, the IQ score is es- 
sentially stähle within the constraints of measurement errorU* (On the cum- 
paratively rare occasions when large changes in IQ are observed, there is 
usually an obvious explanation. The child had been bedridden with a long 
i[Iness before one of the tests, for example, or there was severe emotional 
disturbance at the time of one or both ot the tests. 


The IQ score of an individual might have been higher if he had been 
raised in more fortunate circumstances. Chapter 17 discusses this issue 
in more detail. But for purposes of Part II, the question is not what might 
have been but what is. ln discussions of intelligence, people ohsess about 
nature versus nurture, thinking that it matters fundamentally whether 
a person with a low IQ at, say, age 15 came by that IQ through a defi- 
cient environment or by bad luck in the genetic draw. But it does not 
matter for the kinds of issues we consider in Part II. The AFQT test 
scores for the NLSY sample were obtained when the subjects were 15 
to 23 years of age, and their IQ scores were already as deeply rooted a 
fact about them as their height. 161 

SOCIOECONOMIC BACKGROUND VERSUS COGNITIVE 
ABILITY 

For a Century after poverty became a topic of systematic analysis in the 
mid- 1800s, it was taken for granted that there were different kinds of 


poor people, with “deserving” and “undeserving” being one of the pri- 
mary divisions. 7 Some people were poor because of circumstances be- 
yond their control; others were poor as a result of their own behavior. 
Such distinctions among types of poverty were still intellectually re- 
spectable into the beginning of the Kennedy administration in 1961. 
By the end of the 1960s, they were not. Poverty was now seen as a prod- 
uct of broad systemic causes, not of individual characteristics. To say 
otherwise was to “blame the victim.” 8 Accordingly, the technical Ut- 
erature about the causes of current poverty deals almost exclusively in 
economic and social explanations rather than with individual charac¬ 
teristics. Much of this literature focuses on poverty among blacks and 
its roots in racism and does not apply to the topic at hand: poverty 
among whites. 

1t. seems easy to make the case that poverty among whites also arises 
from social and economic causes. Using the NLSY, we convert Infor¬ 
mation about the education, occupations, and income of the parents of 
the NLSY youths into an index of socioeconomic Status (SES) in which 
the highest scores indicate advanced education, affluence, and presti- 
gious occupations. The lowest scores indicate poverty, meager educa¬ 
tion, and the most menial jobs. Suppose we then take the SES index 
and divide all the NLSY youngsters into five socioeconomic classes on 
exactly the same basis that we defined cognitive classes (split into cat¬ 
egories of 5-20-50-20-5 percent of rhe population). We then ask, 
What percentage of people who came from those socioeconomic hack- 
grounds were below the poverty line in their late 20s and early 30s (i.e., 
in 1989)? We exclude those who were still in school. The answer for 
non-Latino whites in the NLSY sample is shown in the following table. 
What could be plainer? Hardly any of the lucky 5 percent who had 
grown up in the most advantaged circumstances were in poverty (only 


White Poverty by Parents’ Socioeconomic Class 

Parents * 

Socioeconomic Class 

Percentage in Poverty 

Very high 

3 

High 

3 

Mid 

7 

Low 

12 

Very low 

24 

Overall average 

7 
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3 percent). Meanwhile, the white children of parents in the lowest so¬ 
cioeconomic dass had a poverty rate of 24 percent. Rank hath its Priv¬ 
ileges, and in the United States one of those Privileges is to confer 
economic benefits on your children. The way to avoid poverty in the 
United States is to he born into an advantaged home. 

Now we switch lenses. Instead of using socioeconomic dass, we now 
ask, What percentage of the people who are in the different cognitive 
classes were below the poverty line in 1989? The answer is in the next 
table. There are similarities at the top of the ladder. Those in the top 


White Poverty by Cognitive Class 

Cognitive Class 

Percentage in Poverty 

I Very brighr 

2 

11 Bright 

3 

III Normal 

6 

IV Du 11 

16 

V Very dull 

30 

Overall average 

7 


three classes—75 percent of the population—in either socioeconomic 
background or intelligence had similar poverty rates. But rhen the story 
diverges. As cognitive ability feil below average, poverty rose even more 
steeply among the cognitively disadvantaged than the socioeconomi- 
cally disadvantaged. bor the very dull, in the bottom 5 percent in IQ, 
30 percent were below the poverty line, fifteen times the rate for the 
people in the top cognitive dass. 

Taken one variable at a time, the data fit both hypotheses: Poverty 
is associated with socioeconomic disadvantage and even more strongly 
with cognitive disadvantage. Which is really explaining the relation- 
ship? And so we introduce a way of assessing the comparative roles of 
intelligence and socioeconomic background, which we will he using 
several times in the course of the subsequent chapters. 

We want to disentangle the comparative roles of cognitive abiliry and 
socioeconomic background in explaining poverty. The dependent vari¬ 
able, poverty, has just two possible values: Yes, the family had an income 
below the poverty line in 1989, or no, its income was above the poverty 


line. The Statistical method is a type of regression analysis specifically 
designed to estimate relationships for a yes-no kind of dependent vari¬ 
able. 191 In our first look at this question, we see how much poverty de- 
pends on three independent variables: IQ, age, and parental 
socioeconomic Status (hereafter called “parental SES”). The sample 
consists of all whites in the NLSY who were out of school in 1989. |10) 
We are asking a straightforward question: 

Given information ahout intelligence, socioeconomic Status, and age, 
what is our best estimate of the probability that a family was below the 
poverty line in 1989? 

for which a Computer, using the suitable Software, can provide an an¬ 
swer. Then we ask a second question: 

Takmg the other factors into account , how much remaining effect does any 
one of the independent variables have on the probability of being in 
poverty? 

for which the Computer can also provide an answer. 

When we apply these questions to the NLSY data, the figure below 
shows what emerges. First, age in itself is not important in determining 
whether someone is in poverty once the other factors of intelligence 
and parental family background are taken into account. 11 L! Statistically, 
its impact is negligible. 

This leaves us with the two competing explanations that prompted 
the analysis in the first place: the socioeconomic background in which 
the NLSY youth grew up, and his own IQ score. 

The black line lets you ask, “Imagine a person in the NLSY who 
comes frorn a family of exactly average socioeconomic background and 
exactly average age. 1121 What are this person’s chances of being in 
poverty if he is very smart? Very dumb?'” To find out his chances if he is 
smart, look toward the far right-hand part of the graph. A person with 
an IQ 2 SDs above the mean has an IQ of 130, which is higher than 98 
percent of the population. Reading across to the vertical axis on the left, 
that person has less than a 2 percent chance of being in poverty (always 
assuming that his socioeconomic background was average). Now think 
about someone who is far below average in cognitive ability, with an IQ 
2 SDs below the mean (an IQ of 70, higher than just 2 percent of the 
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The comparative roles of IQ and parental SES in determining 
whether young white adults are below the poverty line 

Probability of being in poverty 
30% - 



Note: Fot cornpunng the plot, age and either SES (tor the black curve) or IQ (tor rhe gray 
curve) were set at their mean values. 

Population). Look at the far left-hand part of the graph. Now, our imag- 
inary person with an average socioeconomic background has about a 26 
percent chance of being in poverty. The gray line lets you ask, “Imag- 
ine a person in the NLSY who is exactly average in IQ and age. What 
are this person’s chances of being in poverty if he came from an ex- 
tremely advantaged socioeconomic background? An extremely de- 


Refresher 

1/2 Standard deviation below and above the mean cuts off the Mst and 
69th percentiles. A 1/2 SD difference is substantial. 

1 Standard deviation below and ahove the mean cuts off the 16th and 
84th percentiles. A 1 SD difference is big. 

2 Standard deviations below and ahove the mean cuts off the 2d and 
98th percentiles. A 2 SD difference is very big. 

A “Standard score” means one that is expressed in terms of Standard de¬ 
viations, 
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prived socioeconomic background?” As the gray line indicates, theproh- 
ahility ot being in poverty rises if he was raised by parents who were low 
in socioeconomic Status , Hut only gradually. 

In general, the visual appearance of the graph lets you see quickly the 
result that emerges from a close analysis: Cognitive ahility is more im¬ 
portant than parental SES in determining poverty. 1 M| 

This does not mean that socioeconomic background is irrelevant. 
The magnitude of the eff ec t shown in the graph and its Statistical reg- 
ularity makes socioeconomic Status significant in a Statistical sense. To 
put it into policy terms, the Starting line remains unequal in American 
society, even among whites. On the other hand, the magnitude of rhe 
disadvantage is not as large as one might expect. For example, imagine 
a white person horn in 1961 who came from an unusually deprived so¬ 
cioeconomic background: parenrs who worked at the most menial of 
johs, often unemployed, neither of whom had a high school education 
(a description of what it means to have a socioeconomic Status index 
score in rhe 2d centile on socioeconomic dass). If that person has an IQ 
of 100—nothing special, just the national average—-the chance of 
falling below a poverty-level income in 1989 was 11 percent. It is not 
:ero, and it is not as small as the risk of poverty tor someone from a less 
punishing environment, but in many ways this is an astonishing State¬ 
ment of progress. Conversely, suppose that the person comes from the 
2d centile in IQ but his parents were average in socioeconomic Status— 
which means that his parents worked at skilled jobs, had at least fin- 
ished high school, and had an average income. Despite coming from 
that solid background, his odds of being in poverty are 26 percent, more 
than twice as great as the odds Pacing the person from a deprived home 
hur with average intelligence. 

In sum: Low intelligence means a comparatively high risk of poverty. 
If a white child of rhe next generation could he given a choice between 
being disadvantaged in socioeconomic Status or disadvantaged in Intel¬ 
ligence, there is no question about the right choice. 

Education 

Now let us consider whether education real ly exp!ains what is going on. 
One familiär hyporhesLs is that if you can only get people to stick with 
school long enough, they will be able to stay out of poverty even if they 
have modest test scores. 

As in subsequent chapters, we will consider two educational groups: 
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In the white high school sample, high IQ makes a difference in 
avoiding poverty; in the College sample, hardly anyone was poor 


Probability of being in poverty 

25%- 



Note: For Computing the plur, age and either SES (tor rhe black curve) nr IQ (for the gray 
eurve) were set at their mean values, 

white people with a high school degree (no more, no less) and those 
with a hachelor’s degree (no more, no less). The tigure ahove shows the 
results when the poverty rates for these two groups are considered sep¬ 
arate ly. 

First, look at the pair of Unesfor the College graduates. We show them 
only for values greater than the mean, to avoid nonsensical implications 
(such as showing predicted poverty rate for a College graduate with an 
IQ two Standard deviations below the mean). The basic lesson of rhe 
graph is that people who can complete a bachelor’s degree seldom end 
up poor, no matter what. This makes sense. Although income varies mv 
portantly for College graduates at different cognitive levels (as we dis- 
cussed in Chapters 2 through 4), the floor income is likely to be well 
above the poverty line. College has economic value independent of 
cognitive ability, whether as a credential, for the skills that are acquired, 
or as an indicator of personal qualities besides IQ (diligence, persis- 
tence) that make for economic success in life. It is impossible with these 
data to disentangle what contriburions these different explanations 
make. 


The two lines showing the results for high school. graduates are much 
more informative. These people are taking a homogeneous and modest 
set of educational skills to the workplace. Within this group, IQ has a 
strong effect independent of socioeconomic background. A young adult 
at the bottom 2 percent of IQ had about a 24 percent change of being 
in poverty compared to less than a 2 percent chance for one at the top 
2 percent of IQ (given average age and socioeconomic background, and 
just a high school diploma). The parenrs’ background made much less 
difference. Cognitive ability still has a major effect on poverty even 
within groups with identical education. 

COMPLICATING THE ISSUE: POVERTY AMONG CHILDREN 

How does the Information we have just presented help in trying to uiv 
derstand the narure of poverty in America / To illustrate, consider one 
of the most painful topics in recent American social policy, rhe grow- 
ing proportion of poor who consist of children. As of the 1991 figures, 
22 percent of all children under the age of 15 were below the official 
poverty line, twice as high as the poverty rate among those age 1 5 and 
over. IMI It is a scandalously high figure in a country as wealthy as the 
United States. Presumably every reader wishes for policies that would 
reduce poverty among children. 

W/ry are so many children in poverty in a rieh country? ln political 
dehate, the question is usually glossed over. An impression is conveyed 
that poverty among children is something that has grown everywhere 
in the United States, for all kinds of families, for reasons vaguely con¬ 
nected with economic troubles, ungenerous social policies during the 
1980s, and discrimination against women and minority groups. 

Specialists who have followed these figures know that this explana- 
tion is misleading. 1 *’ Poverty among children has always heen much 
higher in families headed by a single woman, whether she is divorced 
or never married. For families headed by a single woman, the poverty 
rate in 1991 was 36 percent; for all other American families, 6 percent. 1 ' 
Indeed, the national poverty rate for households headed by a single 
woman has been ahove 30 percent since official poverty figures hegan 
to he available in 1959. 17 The equation is brutal ly simple: The higher 
the proportion of children who live in households headed hy single 
women, then, ceteris paribus, the higher the proportion of children who 
will live in poverty. An important part of the increasing child poverty 
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in ehe United States is owed to the increasing proportion ot ehildren 
who live in those familiesT The political left and right ditfer in their 
views of what policies ro follow in response to this state of affairs, hut 
reeently they have hroadly agreed on the jo int roles of gender and 
changes in family structure in pushing up the figures for child poverty. 

Poverty Among Children: The Role of the Mothers IQ 

What does IQ add to this picture? It allows us to tocus sharpiy on who 
is poor and why, and to dispense with a numher of mistaken ideas. To 
see how, let us consider women, and specifically women with chib 
dren. |1M| Here is the graph that results when we ask how oben mothers 
withdiffering IQs and differing family structures suffer from poverty. (ln 
the figure, the effects of the mothers’ socioeconomie hackground are 
held constant, as are the numher of children, which is faetored into rhe 
calculation of the poverty line.) 

The first, giaring point of the figure is that marriage is a powerful 
poverty preventative, and this is true for women even of modest cogni- 

The role of the mother’s IQ in determining 
which white children are poor 


Probability of being in poverty as IQ goes from low to high 



No ecs: Fnr Computing rhe plot, agc and SES werc set at rheir mean values. 


tive ability. A married white woman with children who is markedly be- 
low average in cognitive ability—ar. the 16th centile, say, one Standard 
deviation below the mean—-from an average socioeconomie hack- 
ground had only a 10 percent probability of poverty. 

The seeond point of the graph is that to be without a husband in the 
house is to run a high risk of poverty, even if the woman was raised 
in an average socioeconomie hackground. Such a woman, with even 
an average IQ, ran a 33 percent chance of being in poverty. If she 
was unlucky enough to have an IQ of only 85, she had more than a 
SO percent chance—five rimes as high as the risk faced by a married 
woman of identical IQ and socioeconomie hackground. Even a woman 
with a conspicuously high IQ of 130 (two Standard deviations above 
the mean) was predicted to have a poverty rate of 10 percent if she 
was a single mother, which Ls quite high eompared to white women 
in general. Perhaps surprisingly, it did not make much difference which 
of the three kinds of “nonmarriage”—Separation, divorce, or no mar¬ 
riage at all—was involved. The results for all three groups of women 
were drastically different from the results for married women, and 
quite similar to each other (which is why they are grouped in the figure.) 

The third obvious conc lusion is that IQ is ext.remely important in de- 
rermining poverty among women without a hushand present. A poverty 
rate of 10 percent for women with IQs of 130 may be high eompared to 
some Standards, but it is tiny eompared to the steeply rising probahili- 
ties of poverty that characterize women with below average cognitive 
ability. 

Poverty Among Children: The Role of the Mothers Socioeconomie 
Background 

Now we pursue the same issue hut in terms of socioeconomie back- 
ground. Remernber that the steep downward curve in the figure above 
for unmarried mothers is rhe effect of IQ after Holding the effects of so- 
cioeconomic Status constant. What is the role of socioeconomie Back¬ 
ground after we take IQ into account? Not much, as the next figure 
shows. 

We used the same scale on the vertical axis in both of the preceding 
graphs to make the comparison with IQ easier. The conclu$ion is that 
no matter how rieh and well educated the parents of rhe mother might 
have been, a separated, divorced, or never-married white woman with 
children and an average IQ was still looking at nearly a 30 percent 
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The role of the mother’s socioeconomic background in 
determining which white children are poor 


levels of intelligence much harder than they do those at high levels of 
intelligence—even after socioeconomic background is held constanr. 


Probability of being in poverty as 
parental SES goes from low to high 
70% - 


HOLDING BOTH COMPLICATIONS AND POLICY THOUGHTS 
ATBAY 
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White mothers who are separated , 
divorced, or never rrutrried 



Married white mothers 
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Note: For o unput in^ rhe plor, age nnd IQ wert* sei ar rheir mean values. 


chance of being helow the poverty line, far above the usual level for 
whires and far above the level facing a woman of average socioeconomic 
background hut superior IQ. We cannot even be sure that higher so- 
cioeconomic background recluces the poverty rate at all for unmarried 
women after the contribution of IQ has been extracted; the dovvnward 
slope of the line plotted in the graph does not approach Statistical sig- 
nificanctc 1 ^ 1 

There are few clearer arguments for bringing cognitive ability into 
the analysis of social problems. Consider the hundreds of articles writ- 
ten about poverty among children and about the effects of single-par- 
ent families on poverty. Ofcourse, these are important factors: Children 
are more often poor than adults. Family breakup is responsihle for a ma- 
jor portion of the increase in child poverty. But if analysts are trying to 
understand the high rates of poverty among children, it must be done 
against the background that whatever other factors increase the risk of 
poverty among unmarried mothers, they hit unmarried mothers at low 


You have been following a common process in social Science. An ini- 
tially simple issue hecomes suceessively more complicared. And we have 
bare ly gor ten started—an analysis in a teehnical journal seldom has as 
few independent variables as the ones we have examined. For that mat¬ 
ter, even rhis simplified analysis represents only the end result of a long 
process. ln the attached note, we describe how big the rest of the ice- 
berg is. 1 * 11 

Complex analysis has both merits and faults. The merit is that the 
complications are part of reahty. Einstein’s injunction that Solutions 
should be as simple as possible, but no simpler, still applies. At the same 
time, social science offen seems more in need of the inverse injunction, 
ro inrroduce as much complexity as necessary, but no more. Complica¬ 
tions can make us forget; whar we were trying to understand in the first 
place. Here is where we believe the Situation Stands: 

By complicating the picture, we raise additional questions: Education 
is important in affecting poverty; the appropriate next stop is to explore 
how intelligence and socioeconomic Status are related to years ot edu- 
cation. Marriage is important in determining poverty; we should explore 
how intelligence and socioeconomic Status are related to marriage. 
These things we shall do in subsequent chapters. 

But the simple picture, with only IQ, parental SES, and age in the 
equation, resrricted ro our all-white sample, continues to teil a story of 
its own. A major theme in the public dialoguc in the United States has 
been that socioeconomic disad van tage is the primary driving force be- 
hind poverty. The simple picture shows that it just isn’t so for whires.*“ 1 
The high rates of poverty that afflict certain segments of the white pop- 
ulation are determined more by intelligence than by socioeconomic 
background. The force and relevance of this Statement does not seem 
to us diminished by the complications it does not embrace. 

lndeed, now that we are returning to basics, 1er us remember some- 
thing eise that could he overlooked in the weiter of regression analyses. 
The poverty rate for whites in dass V was 30 percent—a percentage 
usually associated with poverty in poor urban neighhorhoods. Ethnically 
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and culturally, rhese are supposed to be the advantaged Americans: 
whites of European descent. But they have one big thing working against 
tbem: they are not very smart. 

Like many other disahilities, low intelligence is not the fault of the 
individual. Everything we know about the causes of cognitive ability, 
genetic and environmental, teils us that by the time people grow to an 
age at which they can he considered responsible moral agents, their IQ 
is fairly well set. Many readers will find that, before writing another 
word, we have already made the case for sweeping policy changes meant 
to rectify what can only be interpreted as a palpably unfair result. 

And yet hetween this and the chapters that will explore those policy 
issues Stretch a few hundred pages of intervening analysis. There is a rea- 
son for them. By adding poverty to the portrait of cognitive stratifica- 
non descrihed in Part I, we hope to have set the terms of a larger problem 
than income inequality. The issue is not simply how people who are poor 
through no fault of their own can he made not poor but how we—all of 
us, of all ahilities and income levels—can live together in a sociery in 
which all of us can pursue happiness. Changing policy in ways that ab 
fect poverty rates may well be part ofthat solution. But as we observed 
ar the outset of the chapter, poverty itself has been declining as various 
disconrenrs have been rising during this Century, and curing poverty is 
not necessarily going to do much to eure the other pains that afflict 
American sociery. This chapters analysis should establish that the tnv 
Jitional socioeconomic analysis of the origins of poverty is inadequate 
and that intelligence plays a crucial role. We are just at the beginning 
ot understanding how intelligence interacts with the other problems in 
Americas crisis. 


Chapter 6 

Schooling 


schon/ before getting a high schon/ diploma in the old days was usu- 
ally not a sign o f failure. The youngster heul not dropped out but simply rnoved 
07i. As Lue as 19 40, fetver than half of 18-year-olds got a high school diploma. 
But in the postwar era, the high school diploma became the nonn. Nottc not 
ha vi ng one is a social disabilky of so me gravity. 

The usual picture of high school dropouts fncuses on their socioeconomic 
circumstances. h is true that most of them are from poor families, but the re - 
Jationshi/') of socioeconomics to school dropout is not simple. Among whites , 
a/most no one with an IQ in the top quarter of the distrihution fails to get a 
high school education , no 7 narter how poor their families . Dropout is extremely 
rare throughout the upper half of the IQ distributum. Socioeconomic back- 
ground has its most powerful effect at the lowest end of the social Spectrum , 
among studenrs who are already below average in intelligence. Beingpoor has 
a small effect on tiroppmg out of school indepentlent of IQ; it has a sizable m- 
dependent effect on whether a person finishes school with a regulär diploma 
or a high schon/ equivalency certificate. 

To raise the chances of getting a College degree , it helps to he in the upper 
half of the distrihution for either IQ or socioeconomic Status. But the aclvan- 
tage of a high IQ outweighs that of high Status. Similarly, the disadvantage of 
a low IQ outweighs that of low Status. Youngsters from poor backgrounds with 
high IQs are likely to get through College these days , but those with low IQs , 
even if they come from welhtoalo backgrounds, are not. 


O f all the social behaviors that might he linked to cognitive ability, 
school dropout prior to high school graduation is the most ohvi- 
ous, Low intelligence is one of the best predictors ot school failure, and 
students who fail a grade or two are likely to have the least attachment 
to school. And yet this relationship, as strong as it is now, is also new. 
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The very eoncept of school failure is a modern invenrion. In the era of 
rhe one-room schoolhause, students advanced at their own pace. There 
were nt) Normal grade levels, no promotions to the next grade, henee no 
way to fail. 1 

“Dropping out” is an even more recent concept, created by the as- 
sumption that it is normal to remain in school through age 17. Until re- 
cently, it wasn’t typical. ln 1900, the high school diploma was the 
preserve of a tiny minority of American youth: The numher of those 
who got one amounted to only 6 percent of the crop of potential seniors 
that year. This figure, known as the graduation ratio , is calculated as the 
percentage of the 17'yeanold population.“ Perhaps even more startling, 
it was not until the beginning of World War 11 that the graduation ra¬ 
tio first passed the 50 percent mark. The figure shows the story from 
1900 to 1990 

The trendlines that overlie the data indicate two broad phases in this 
ninety-year history. The first phase, from 1908 until the early 1920s, fea- 
tured moderate expansion of high school cducation. It did not appear 
moderate at the time—the graduation rate more than doubled from 
1900 to 1922—but the growth was nonetheless moderate by compari- 
son with steep surge from 1922 until the beginning of World War 11. 

In the first half of the Century, the 
high school diploma becomes the norm 


Graduation ratio 
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This was the opening of the second growth phase, which lasted, with 
an interruption for World War II, until 1964. The story since 1964 has 
heen mixed. Graduation rates stalled during the last half of the 1960s 
and then reversed during the 1970s. The trend since 1980 has heen un- 
certainly and shallowly upward. As of 1992, the graduation ratio for 17' 
yeanolds stood at 76 percent, near the 1969 high of 77 percent. The 
Proportion of people who eventually graduate or get a high school equiv' 
alency certificate now Stands at about 86 percent for the population as 
a whole. 14 ' 

Americans today take it for granted that the goal is to graduate every' 
one and that a high school dropout rate is a social evil. But earlier 
thinkers, even those in our liberal tradition, were dubious ahout edu- 
cating the entire population beyond the rudiments of literacy. Voltaire’s 
view that “the lower classes should be guided, not educated,” was typi¬ 
cal until this Century. 5 Even early in this Century, many ohservers feared 
that unqualified youngsters were being educated beyond their abilities. 
“We must turn hack the clock,” one prominent educator wrote in 1936, 
“to take some five million boys and girls from the educational dole.” Cl 

And yet when the psychometricians sought to document the fear that 
the country was trying to educate the ineducable, they found little ev- 
idence for it. One investigator, Frank Finch, assemhled all of the com- 
petent studies of the Intelligence of high school students conducted 
from 1916 (rhe earliest study he could find) to 1942. The mean IQ of 
ninth graders in these studies was 105; the mean IQ of the twelfth 
graders or graduates was 107, trivially different.' 71 The data suggest that 
the large numher of youngsters who dropped out between ninth grade 
and high school graduation averaged less than 105 in IQ, but not by 
much (a ealcuhmon explained in the note).* 81 

Finch found no increasing trend over time in the IQ gap between 
dropouts and graduates during the early part of the Century. Replicating 
the story that we described regarding the College level in Chapter 1, the 
first decades of the Century saw American high school education mush- 
room in size without having to dip much deeper into the intellectual 
pool. This process could not go on forever. As the high school diploma 
became the norm, the dropouts were likely to hecome more self-selected 
for low IQ, and so indeed it transpired. 

We have not heen ahle to determine exactly when the gap between 
nongraduares and graduates began to open up. Probably it was widen- 
ing even by the early 1940s. By the early 1950s, a study in Iowa found 
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a ten-point gap in IQ hetween dropouts and high school graduates.An- 
other study, in 1949, of 2,600 students who had been given an IQ test 
in the seventh grade, found a gap hetween the graduates and nongrad- 
uates of about thirteen IQ points, close to the IQ’s Standard dcviation 
of 15. 10 The proportion of students getting a high school diploma had 
reached about 55 percent by then. By the spring of 1960, when 70 per- 
cent of students were graduating, the data from Project T ALENT—the 
large, nationally representative sample of high school students men- 
tioned in Chapter 1—indicate a gap äquivalent to almost sixteen IQ 
points hetween the academic aptitude of those who graduated and those 
who did not, slightly more than a Standard dcviation. 1111 This is tanta- 
mount to saying that the average dropout had an IQ that put him at the 
15th centile of those who graduated. 

The Situation seems to have remained roughly the same since then. By 
the Standard current definition of the population that “gets a high school 
education”—meaning either a diploma or by passing an equivalency ex- 
amination—the NLSY data reveal that the mean score of those who get 
a high school education is 1.28 Standard deviations higher than those 
who do not. Comparing those who get the ordinary high school diploma 
with all those who left high school before doing so (including those who 
later get an equivalency certificate), the gap is 1.02 Standard deviations. 

WHITE HIGH SCHOOL DROPOUT IN THE NLSY 

Who drops out of high school these days? The following tahle shows 
the story for NLSY whites in the various cognitive classes. The results 


Failure to Get a High School 

Education Among Whites 

Cognitive Class 

Percentage Who Did Not 
Graduate or Pass a High 
School Equivalency Exam 

l Very bright 

0 

II Bright 

0* 

III Normal 

6 

[V Du 11 

35 

V Very dull 

55 

Overall average 

9 

1 The actual figure w; 

us 0.4 percent. 


could hardly be starker. Among whites in the top quartile (Classes I and 
II together), virtually everyone got a high school education. In the bot- 
tom quartile of the IQ distribution (Classes IV and V together), 39 per¬ 
cent of whites did notJ 121 This huge discrepancy is also predictahle, 
however, given the close relationship hetween IQ and educational at- 
tainment;—so predictahle that we should pause for a moment before 
viewing dropout rates with alarm. Is a 39 percent dropout rate for stu¬ 
dents in the lowest quartile of IQ “high”? From one perspective, it seems 
so, considering how essential education appears to be for making a liv- 
ing. From another perspective, it is remarkable that over 60 percent of 
white youths with IQs under 90 did get a high school education. It is 
particularly remarkable that nearly half of the youths in Class V, with 
IQs of 75 and under, completed a high school education, despite heing 
on the borderline (or beyond) of the clinical definition of retarded. |n! 
Whether these figures say something about the ability of low-lQ stu¬ 
dents to learn or about the state of American secondary education is a 
topic we defer until Chapter 18. 

What Does “A High School Education * M ean? 

The Standard question now arises: To what extern are we looking at an 
effect of cognitive ability, and to what extent are white children from 
poor socioeconomic backgrounds being shunted out of the school System 
because of their backgrounds? The answer depends on exactly how the 
question is asked. Specifically, it is important to be precise about what “a 
high school education” means. ln the table above, it was defined to in- 
clude anyone who graduated from high school in the normal way or who 
passed an equivalency examination, known generically as a GED (for 
General Educational Development). 1 * 4 This has become nearly Standard 
practice when researchers and journalists alike talk about high school 
dropout. But recent work by economists Steven Cameron and James 
Heckman has demonstrated that GED youths are not equivalent to “nor¬ 
mal” graduates in terms of their success in the job market. 1 ' In their un- 
employment rates, job tenure, and wages, the GEDs look more like 
dropouts than they look like high school graduates, raising the possibil- 
ity that they differ from other high school graduates in a variety of ways 
that makes it dangerous to lump all people with “a high school educa¬ 
tion” into a single group. We know from our own analyses that the white 
GEDs in the NLSY had an average IQ half a Standard deviation lower 
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than the average for white high school graduates. Furthermore, apart 
from the specifics of the data, it is apparent that the nature of the GED 
student’s behavior—giving up on school, then later returning to pass the 
examination—is different in kind from that of both the dropout who 
leaves school and never goes back, and from that of the youth who sticks 
with four cotivsecutive years of schooling and gets a diploma. 

To clinch their case for separating GED from “normal” graduates, 
Cameron and Heckman also point out that the size of the GED 
population, once negligible, has grown to become a substantial minor- 
ity. In 1968, GED graduates accounted for only 5 percent of all high 
school certifications. By 1980, that proportion had reached more than 
13 percent, where it has remained, with minor fluctuations, ever since. 1161 

We are persuaded that these disparate groups need to be separated 
and will therefore analyze separately the relationship of IQ and socio- 
economic background to each of these two types of dropouts. 

The Permanent Dropouts 

First, we compare students who got a high school degree through the 
normal process with dropouts who left school never to return, shown in 
the next figure. 

Staying through high school to receive a diploma did not require 
genius or high^status parents. Dropout rates were extremely low for 
white students who were of at least average mtelligence or socio- 
economic background. But dropout rates rose rapid ly when those varn 
ables feil below average, with the rise being precipitous for students 
with low IQ. 

A closer look at these numbers dispels the stereotype of the high 
school dropout as the bright but unlucky youngster whose talents are 
wasted because of economic disadvantage or a school System that can- 
not hold onto him—the stereotype that people have in mind when they 
lament the American dropout rate because it is frittering away the na- 
tion’s human Capital. 1173 Among whites, hardly anyone in the NLSY fit 
that description. Of the whites who dropped out never to retum, only 
three-tenths of 1 percent met a realistic definition of the gifted-but-dis- 
advantaged dropout (top quartile of IQ, bottom quartile of socioeco- 
nomic background.) Another eight-tenths of 1 percent were in the top 
quartile of IQ and the third quartile of the socioeconomic distribution. 


ln predicting which white youths will never complete a 
high school education, IQ is more important than SES 


Probability of permanently dropping out of high school 
70% - 



Note: Hör computing the plot, nge und either SES (for the black curve) or IQ (for the gray 
curve) were set at their mean values. 


Even when we relax the definition to include everyone who is from the 
tüp half of the IQ distribution and the bottom half of the socioeconomic 
distribution—a very loose definition indeed—we are talking ahout a 
grand total of only 5.5 percent of the permanent dropouts, or half of 1 
percent of American whites in the NLSY. ,1S| 

The permanent dropout instead fits the older image, more common 
among the general public than intellectuals, of the youngster who is 
both not very smart and from the wrong side of the tracks. To put it 
technically, the effects of socioeconomic Status and intelligence inrer- 
act. A white youth who had both low cognitive ability and a poor so- 
cioeconomic background was at even more risk of dropout than the 
separate effects of each variable would lead one to expectJ ' Of white 
youths who were in the bottom quartile on both IQ and socioeconomic 
Status, half permanently dropped out of school. 
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The Temporary Dropouts 

The temporary dropouts,” who go back to get a GED, teil a different 
story. In the figure below, they are compared with students who re- 
ceived a high school diploma in the usual way. In effect, the figure says 

For temporary dropouts, the importance of SES increases sharply 

Probability of getting a GED instead of a high school diploma 
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that if you want to predict who will stay in high school through the 
diploma, and who will instead drop out of school and eventually get a 
GED, you are better off sizing up their parents than looking at their IQ 
scores. ln speculating about what lies behind these numbers, three im- 
ages come to mind. First, there are middle- and the upper-class parents 
who find it unthinkable that their children should drop out of high 
school—call the therapist, find a special school, do anything, but keep 
the child in school. Then one thinks of working-class parents (most of 
whom are somewhere around the mean on the socioeconomic index), 
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urging their children to get an educarion and do better than their par- 
ents. Finally, one thinks of lower-ciass parents, the Pap Finns of Amer¬ 
ican folklore, complaining about their children wasting all that time 
on hook learning. The NLSY data are consistent with these populär 
images. For youths with a socioeconomic hackground anywhere near 
or above the mean, the high school diploma is the norm. As socio¬ 
economic hackground falls below the mean, the probability that the 
high school certification came through a GED instead of the normal 
route soars. 

This view also firs into the Oameron and Heckman finding that 
GED students are more like dropouts than high school graduates in the 
Problems they experience in the lahor market. Interpretively, the 
brighter dropouts may go hack to get a GED, but they continue to share 
in common with the permanent dropouts a lower-ciass social hack¬ 
ground that has not inculeated a work ethic that makes for success in 
the lahor force. 1 * 01 Thus, GEDs are more like normal graduates in their 
intelligente but more like other dropouts in their success in the lahor 
force. 

All of this Interpretation is speculative, and we will leave it to oth- 
ers to determine whether these possibilities stand up to examination. 
Meanvvhile, the results ernphasize the need tor more open exploration 
of a ropic that has heen almost as taboo in some circles as IQ: the pos- 
sibility that “lower dass” in its old-tashioned sense has an impact on 
how people hehave. 

One concrete result of this analysis bears on the presentation in this 
hook. The differences between GED graduates and those with regulär 
diplomas are too great to justify grouping them together. Whenever we 
refer to “a high school education” throughout the rest of Part II, we are 
referring specifically to the normal high school career, completed hy a 
diploma. GED graduates are excluded. 

THE COMPARATIVE ROLE OF IQ AND FAMILY BACKGROUND 
IN GETTING A COLLEGE DEGREE 

As a general Statement, the relationship of IQ to educational attain- 
ment seems to have heen remarkably stähle. Twenty years ago, one of 
the leading texts on the Wechsler Adult Intelligence Scale reported that 
the mean of high school graduates was ahout 105, the mean of College 
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graduates was 115, and the mean ofpeople getting medical degrees and 
Ph.D.s was about 125. 21 The book, published in 1972, wasbased on clin- 
ical experience in the 1950s and 1960s. This summary is virtually iden- 
tical to the story told by the NLSY for whites (who correspond most 
closely with the College population in the 1950s and early 1960s). The 
mean IQ of high school graduates was 106, the mean of College gradu- 
ates was 116, and the mean ot people with Professional degrees was 126. 
The relative roles of socioeconomic Status and IQ in getting a bache- 
lor’s degree for youths of the late 1970s and 1980s are shown in the fig- 
ure below. 

For white youths, being smart is more important 
than being privileged in getting a College degree 


Probability of getting a bachelor’s degree 

80 % - 



Note: For Computing the plot, age and either SES (for the black curve) or IQ (for the «ray 
curvc) were .ser ar cheir mean values.. 


Two broad implicacions of these results stand out. The first is sug- 
gested by the way that both curves hug the bottom throughour the left- 
hand side of the graph. The combination of average-or-helow parental 
SES or average-or-below IQ meant that the odds of getting a College de¬ 


gree are minuscule. The second broad implication is that parental SES 
is important hut not decisive. In terms of this figure, a Student with very 
well-plaeed parents, in the top 2 percent of the socioeconomic scale, 
had oniy a 40 percent chance of getting a College degree if he had onlv 
average intelligence. A Student with parents of only average SES— 
lower middle dass, probably without College degrees themselves—who 
is himself in the top 2 percent of IQ had more than a 75 percent chance 
of getting a degree. 

Once again, the common stereotype of the talented-but-disadvan- 
taged-youth-denied-educational-opportunity does not seem to exist in 
significant numbers any longer. Only seven-renths ot I percent of whites 
in the NLSY were both “prime College material” (lQs of 115 or above) 
and markedly disadvantaged in their socioeconomic hackground (in the 
bottom quartile on the SES index). Among this tiny group, it is true 
that fewer than half (46 percent) got College degrees. Those who did 
not, despite having high IQs, may he seen as youths who suffered from 
having a disadvantaged hackground. But recall that this group consists 
ot only tour-tenths of 1 percent of all white youths. A category of wor- 
thy white young persons denied a College educarion because of circuin¬ 
stantes surely exists to some degree, hut of such small size that it does 
not constitute a public policy problem. 

What about another stereotype, the untalented child of rieh parents 
who gets shepherded through to a degree? Almost 5 percent ot white 
youths had helow-average IQs (under 100) and parents in the top quar¬ 
tile of socioeconomic Status. Of those, only 12 percent had gotten Col¬ 
lege degrees, representing just six-tenths of 1 percent of white youths. 
Judging from these data, the common assertion that privileged white 
parents can make sure their children do well in school, no matter what, 
may be exaggerated. 

SUMMING UP 

The act of leaving high school hefore graduating is a rare event among 
whire youths, conspicuously concentrated in the lovvest quartile of cog- 
nitive ability. Among those who drop out, both socioeconomic Status 
and cognitive ability are involved. Most dropouts with above-average 
intelligence go back to get a GED. 1 “ 21 But socioeconomic Status remains 
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bound up with the dropout process. The children of lower-class families 
are more likely to end up with a GED than are the children of average 
or upper-class families. There is irony in this: Throughout Part II, we 
describe social Problems that are more understandable once cognitive 
ability is brought into the picture and for which socioeconomic hack' 
ground is not as important as most people think. But the one social prob- 
lern that has a widely acknowledged cause in cognitive ability—school 
dropout—also has a strong and complex socioeconomic link. 

When it comes to explaining who gets a College education among 
whites, hoth academic merit and socioeconomic hackground play im¬ 
portant roles. But while socioeconomic privilege can help if the young' 
ster is reasonably bright, there are limits to what it can do if he is not. 
And ifcognitive ability is high, socioeconomic disadvantage is no longer 
a significant harrier to getting a College degree. 


Chapter 7 

Unemployment, Idleness, and 
Injury 

Econonusts distmgufs/i between being unemployed and being out of the labor 
force. The unemployed are lookingfor work unsuccessfully. These out of the 
labor force are not looking, at least for the time being. Among young white 
men in their late 20s atid early 30s, both unemployment and being out of the 
labor force are strongly predicted by low cognitive ability, even after taking 
other factors into account. 

M any of the white males in the NLSY who were out of the labor force had 
the obvtous excuse : They were still in College or graduate school. Of thnse not 
in school , 15 percent spent at least a month out of the labor force in 1989. 
The Irroportion was more than twice as high in cognitive Class V as in Class 
I. Socioeconomic hackground was not the explanation. After the effects of IQ 
were take n into account , the probability of speruding time out of the labor force 
went up, not down, as parental SES rose. 

W7ry are young men out ofthe labor force? One ohvious possihiluy is phys- 
ical disability. Yet here too cognitive ability is a sträng fnedictor: Of the men 
who described themselves as being too disabled to work, more than nine out 
of ten were in the bottom quarter of the IQ distribution; fewer than one in 
twenty were in the top quarter . A man’s JQ predicted whether he described 
himself as disabled better than the kinds of Job he had held. We do not know 
why intelligence and physical Jnohlems are so closely related, but one possi- 
bility is that less intelligent people are more accident prone. 

The results are similar for unemployment. Among young white men who 
were in the labor market, the likelihood of unemployment for high school grad- 
uates and College graduates was equally dependent on cognitive ability. So¬ 
cioeconomic hackground was irrelevant once intelligence was taken into 
account. 

Most men, whatever their intelligence , are working steadily. However, for 
that minority of men who are either out of the labor force or unemployed, the 
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primary riskfactor seems to be ncither sociocconomic background nor educa - 
tion but loiv cogrative ability. 


H aving a high IQ makes it easier to do well in a joh; vve follovved 
that story in Chapter 3. Bur what about the relationship of cogni- 
tive ability to that crueially important social behavior known as “being 
able to get and hold a job.” To what exrent are dropouts from the labor 
force concentrated in the lowTQ classes? To what extern are the um 
employed concentrated the re? 

In the following discussion, we limit the analysis to males. 1t is still 
accepted that women enter and leave the labor force for reasons hav- 
ing to do with home and Family, introducing a large and complex set of 
issues, whereas healthy adult men are still expected to work. And yet 
something troubling has been happening in that area, and for a long 
time. The problem is shown in the figure helow for a group of young 
men who are likely to be (on average) in the lower half of the IQ dis- 
tribution: men 16 to 19 years who are not enrolled in school. 

Since mkheentury, teenage boys not in school are 
increasingly not employed either 


Employment among men ages 16-19 who are not in school 
90% - 



1954 1959 1964 1969 1974 1979 1984 1989 


Sourcey. Bureau of Labor Statistics, 1982, Table C-42; unpuhlished data provided by the Bu¬ 
reau of Labor Statistics. 
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Alrhough the economy has gone up and down over the last forty years 
and rhe employment of these young men with it, the long-term em- 
ploymenr trend of their employment has been downhill. The Overall 
drop has nor heen small. In 1953, rhe first year for which data are avaib 
able, more than 86 percent of these young men had jobs. ln 1992, it was 
just 66 percent. 

Large mac.roeconomic and macrosocial forces, which we will nor try 
to cover, have heen associated with this trend in employment. 111 ln this 
chapter, we are concerned with what Intelligence now has to do with 
getting and holding a job. Tb explore the answer, we divido the em- 
ployment problem into its two constituent parts, the unernployed and 
those not even looking for work. All of the analyses that follow referex- 
clusively to whites; in this case white males. 

LABOR FORCE DROPOUT 

To qualify as “participating in the labor force,” it is not neeessary to be 
employed; it is neeessary only to he looking for work. Seen from this 
perspective, there are only a few valid reasons why a man might nor be 
in the labor force. Ile might be a fulbtime Student; disabled; institu- 
tionalized or in the armed forces; retired; independently wealrhy; stay- 
ing at home caring for the chiLiren while bis wife makes a salary. Or, it 
may be argued, a man may legirimately he out of the labor force if he is 
convinced that he eannot find a joh even if he tries. But this comes close 
to exhausting the list of legititnate reasons. 

As of the 1990 interview wave, the members of the NLSY sample 
were in an ideal position for assessing labor force parricipation. They 
were 25 to 33 years old, in their prime working years, and they were in- 
deed a hardworking group. Ninery-three percent of them had jobs. 
Fe wer than 5 percent were out of the labor force altogether. What had 
eaused that small minority to drop out ot the labor force? And was there 
any relarionship berween being out of the labor force and intelligence? 

One such relationship was entirely predictable. A few men were out 
of the labor force because they were still in school in their late 20s and 
early 30s—most of them in law school, medical school, or studying for 
rhe doctorate. They were concentrated in the top cognitive classes. But 
this does not teil us much about who leaves the labor force. We will ex- 
clude them from the subsequent analysis and focus on men who were 
out of the labor force for reasons other than school. 
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To structure the analysis, let us ask who spent at least a month out of 
the labor force during calendar year 1989. Here is the breakdown of la- 
bor force dropout by cognitive dass for white males. 121 Dropout from the 
labor force rose as cognitive ability feil. The percentage of Class V men 


Which White Young Men Spent a 
Month or More Out of the Labor 

Force in 

1989? 

Cognitive Class 

Percentage 

I Very bright 

10 

11 Bright 

14 

111 Normal 

15 

IV Dull 

19 

V Very dull 

22 

Overall average 

15 


who were out of the labor force was a little more than twice the per¬ 
centage for men in Class 1. 

SoaoEcoNOMic Backorouno Vrrsus Gx^hitivf Ability. The nexr 
Step, in line with our Standard procedure, is to examine how much of 
the difference mav be accounted for by the man’s socioeconomic 
background. The thing to be explained (the dependent variable) is 
the probability of spending at least a month out of the labor force in 
1989. Our basic analysis has the usual three explanatory variables: 
parental SES, age, and IQ. The results are shown in the figure below. 
In this analysis, we exclude all men who in either 1989 or 1990 
reported that they were in sehool, the military, or were physically 
unahle to work. 

These results are the first example of a phenomenon you will see again 
in the chapters of Part II. If we had run this analysis with just socioeco- 
nomic background and age as the explanatory variables, we would have 
found a mildly interesting but unsurprising result: Holding age constant, 
white men from more privileged hackgrounds have a modestly smaller 
chance of dropping out of the labor force than white men from deprived 


IQ and socioeconomic background have opposite effects 
on leaving the labor force among white men 


Probability of being out of the labor force for a month or more 
2(Y/( — 



Very low Very high 

(-2 SDs) ( + 2 SDs) 

Note: For eompurinj» the plor, age and either SES (for the hlaek eurve) or JQ (for the gray 
curve) were sei ar rheir mean valutw 


hackgrounds. Bur when IQ is added to the equation, the role of socio- 
economic background either disappears cntirely or moves in the oppo- 
site direction. Uiven equal age and IQ, a young man from a family with 
high socioeconomic Status was more likely to spend time out of the labor 
force than the young man from a family with low socioeconomic Status. 1 * 1 
ln contrasr, IQ had a large positive impact on staying at work. A man 
of average age and socioeconomic background in the 2d centile of IQ 
had almost a 20 percent chance of spending at least a month out of the 
labor force, compared to only a 5 percent chance for a man at the 98th 
centile. 

1t is not hard to imagine why high Intelligence helps keep a man at 
work. As Chapter 3 diseussed, competence in the workplace is related 
to intelligence, and competent people more than incompetent people 
are likely to find the workplace a congenial and rewarding place. Hence, 
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other things equal, they are more likely than incompetent people to he 
in the labor force. Intelligence is also related to time horizons. A male 
in bis 20s has rnany diverting ways to spend his time, from traveling the 
world to seeing how many women he can romance, all of them a lot 
more fun than working forty hours a week at a joh. A shortsighted man 
may he tempted to take a few months off here and there; he thinks he 
can always pick up again when he feels like it. A farsighted man teils 
himself that if he wants to lay the groundwork for a secure future, he 
had hetter estahlish a record as a reliahle employee now, while he is 
young. Statistically, smart men tend to he more farsighted than dumh 
men. 

ln conrrast to IQ, the role of parental SES is inherently amhiguous. 
One possihility is that growing up in a privileged home foretells low 
dropout rates, because the parents in such households socialize their sons 
to conventional work. But rhis relationship may hreak down among the 
wealthy, whose son has the option of living comfortably without a 
weekly paycheck. ln any case, aren’t working-class homes also adamant 
about raising sons to go out and get a joh? And don’t young men from 
lowenclass homes have a strong economic ineentive to stay in the labor 
force because they are likely to need the money? The Statistical 
relationship with parental SES that shows up in the analysis suggests 
that higher Status may facilitate lahor force dropout, at least for short 
periods. 

The analysis of lahor force dropout is also the first example in Part 11 
ot a significant relationship that is nonetheless modest. When we know 
from the outset that 78 percent of white men in dass V—borderline 
retarded or helow—did not drop out of the lahor force for as much as a 
month, we can also infer that all sorts of things besides IQ are impor¬ 
tant in determining whether someone stays at work. The analysis we 
have presented adds to our understanding without enabling us to ex- 
plain fully the phenomenon of labor force dropout. 

EnuCATION. Conducting the analysis separately for our two educational 
samples (those with a bachelor’s degree, no more and no less, and those 
with a high school diploma, no more and no less) does not change the 
picture. High intelligence played a larger independent role in reducing 
lahor force dropout among the College sample than among the high 
school sample. And for hoth samples, high socioeconomic hackground 
did not decrease lahor force dropout independent of IQ and age. Once 


again, the probability of dropout actually increased with socioeconomic 
hackground. 

JOB DISABILITIES 

ln the preceding analysis, we excluded all the cases in which men re^ 
ported that they were unahle to work. But it is not that simple. Low cog- 
nitive ahüity increases the risk of being out of the lahor force for healthy 
young men, but it also increases the risk of not: heing healthy. The hreak' 
down by cognitive classes is shown in the following table. The rela- 


Job Disability Among Young White Males 


No. per 1,000 


No. per 1,000 Who 

Who Reported Being 


Reported Limits in 

Prevented from 


Amount or Kind of 

Working by Health 

Cognitive 

Work by Health 

Problems 

Class 

Problems 

0 

1 Very Bright 

13 

5 

11 Bright 

21 

5 

111 Normal 

37 

36 

IV Du 11 

45 

78 

V Very Juli 

62 

11 

Overall average 

33 


tionship ot IQ with hoth variables is conspicuous but more dramatic for 
men reporting that their disability prevents them from working. The 
rate per 1,000 of men who said they were prevented from working hy a 
physical disability jumped sevenfold from Class 111 to Class IV, and then 
more than doubled again from Class IV to Class V. 

A moment s thought suggests a plausible explanation: Men with low 
intelligence work primarily in blue-collar, manual johs and thus are 
more likely to get hurt than are men sitting around Conference tables. 
Being injured is more likely to shrink the joh market for a blue-collar 
worker than a for a white-collar worker. An executive with a limp can 
still be an executive; a manual laborer with a limp faces a more serious 
joh impediment. This plausible hypothesis appears to be modestly coiv 
firmed in a simple cross^classification of disabilities with type of joh. 
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More blue-collar workers reported some health limitation than did 
white-collar workers (38 per 1,000 versus 28 per 1,000), and more blue' 
collar workers reported being prevented from working than did whitt> 
collar workers (5 per 1,000 versus 2 per 1,000). 

But the explanation falls to account lor ehe relarionship of disability 
with intelligence. Forexample, given average cognitive ability and age, 
the odds of having reported a job limitation because of health were about 
3.3 percent for white men working in white'Collar jobs compared to 3.8 
percent for white men working in blue-collar jobs, a very minor differ- 
ence. But given that both men have blue-collar jobs, the inan with an IQ 
of 85 had double the prohahility of a work disability of a man with an 
IQ of 115. 

Might there be something within job categories to explain away this 
apparent relationship of IQ to job disability? We explored the quesiion 
froin many angles, as described in the extended note, and the tindmg 
seems to be robust. For whatever reasons, white men with low IQs are 
more Likely to report being unable to work because of health than their 
smarter counterparts, even when the occupational hazards have been 
similar. H 

Why might intelligence be related to disability, independent of the 
line of work itself? An answer leaps to mind: The smarter you are, the 
less likely that. you will have aecidents. ln Lewis Termans sample of peo- 
ple with IQs above 140 (see Chapter 2), aecidents were well he low the 
level observed in the general population. 5 ln other studies, the risk oi 
motor vehicle aecidents rises as the driver’s IQ falls/' Level of educa- 
tion—to some degree, a proxy measure of intelligence—has been linked 
to aecidents and injury, ineluding fatal injury, in other activities as well.' 
Smarter workers are typically more productive workers (see Part 1), and 
we can presume that some portion of what makes a worker productive 
is that he avoids needless aecidents. 

Whatever validity this explanation may have, however, it is unlikely 
to be the whole story. We will simply observe that selLreported health 
Problems are subject to a variety of biases, especially when the question 
is so sensitive as one that asks, in effect, “What is your excuse for not 
looking for a job, young man?“ The evidence in the NLSY regarding the 
seriousness of the ailments, whether a doctor has been consulted, and 
their duration raises questions about whether the selLreported disabil¬ 
ity data have the same meaning when reported by (for example) a sub- 
ject who reports that he was two months out of the labor market because 


of a broken leg and another who reports that he has been out of the la~ 
bor market for five years because of a bad back. 

We leave the analysis of labor force participation with a strong case 
to be made for two points: Cognitive ability is a significant determinant 
of dropout from the labor force by healthy young men, independent of 
other plausibly important variables. And the group of men who are out 
of the labor force because of selLdescribed physical disability tend uv 
ward low cognitive ability, independent of the physical demands of their 
work. 

UNEMPLOYMENT 

Men who are out of the labor force are in one way or another unavaib 
able for work; unemployed men, in contrast, want work but cannot find 
it. The distinction is important. The nation’s unemployment statistics 
are calculated on the basis of people who are looking for work, not on 
those who are out of the labor force. Being unemployed is transitory, a 
way Station on the road to finding a job or dropping out of the work 
force. But it is hard to see much difference between unemployment and 
dropping out in the relationship with intelligence. We begin with the 
basic breakdown, set out in the following table. The extremes—Classes 
I and V—differed markedly in the frequency of unemployment lasting 
a month or more, with dass V experiencing six times the unemploy' 
ment of Class 1. dass IV also had higher unemployment than the up- 
per three-quarters of the IQ distribution. 


Which White Young Men Spent a Month 
or More Unemployed in 1989? 

Cognitive Class 

Percentage 

1 Very bright 

2 

II Bright 

7 

III Normal 

7 

IV Dull 

10 

V Very dull 

12 

Overall average 

7 
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Socibeconormc Background Versus Cognitive Ability 

The independent roles of our three basic variables are shown in the fig- 
ure helow. For a man of average age and socioeconomic Background, 
cognitive ability lowered the probability of being unemployed for a 
month from 15 percent for a man at the 2d centile of IQ to 4 percent 
for men at the 98th centile. Neither parental SES nor age had an ap- 
preciable (or statistically significant) independent effect, 

The Role of Education 

Before looking at the numbers, we would have guessed that cogni¬ 
tive ability would be more important for explaining unemployment 
among the high school sample than among the College sample. The 
logic is straightforward: A College degree supplies a credential and 
sometimes specific job skills that, combined with the College gradu- 


High IQ lowers the probability of a monthdong spell 
of unemployment among white men, while 
socioeconomic background has no effect 


Probability of being unemployed for a month or more 

16 % - 



0 %—i- \ -*-1-1 r~ 

Very low Very high 

(-2SDO (+2 SDs) 

Note: For computing the plor, age and either SES (for the black curve) or IQ (lor the gray 
curve) were set at their mean valucs. 


aie’s greater average ievel of intelligence, should reduce the inde¬ 
pendent role of IQ in ways that would not apply as strongly to high 
school graduates. 1 ^ 1 But this logic is not bornc our by the NLSY. 
Cognitive ability was more important in deterinining unemployment 
among College graduates rhan among the high school sample, although 
the small sample sizes in this analysis muke this conclusion only 
tentative. Socioeconomic background and age were not indepen¬ 
dent ly important in explaining unemployment in the high school or 
College samples. 

A CONCLUSION AND A REMINDER ABOUT INTERPRETING 
RARE EVENTS 

The most basic implication of the analysis is that intelligence and its 
eorrelaies—maturity, farsightedness, and personal competence—are 
important in keeping a person employed and in the labor force. Because 
such quäl i lies are not ent i re ly governed by economic conditions, the 
quesrion of who is working and who is not cannot be answered just in 
terms of what johs are available. 

This Joes not mean we reject the relevance of structural or economic 
conditions. In had economic times, we assume, finding a job is harder 
tor the mature and farsighted as weil as for rhe immature and the short- 
sighted, and it is easier to get discouraged and drop the search. Our goal 
is to add some leavening to the usual fonnulat ion. The state of the econ- 
omy matters, but so do personal qualities, a point that most economists 
would prohahly accept if it were brought to their attention so haldly, hut 
somehow it gets lett out of virtually all discussions of unemployment 
and labor force participation. 

As we close this discussion of cognitive ability and labor force be¬ 
havior, let us he clear about what has and has not been demonstrated. 
ln foeusing on those who did drop out of the labor force and thosc 
who were unemployed* we do not want to forget that most white males 
at every level of cognitive ability were in the labor force and working, 
even at the lowest cognirive levels. Among physically able white males 
in Class V, the bottom 5 percent of the IQ distribution, comprising 
men who are intellectually borderline or clinically retarded, seven 
out of ten were in the labor force for all fifty-two weeks of 1989. 
Of those who were in the labor force throughout the year, more than 
eight out of ten experienced not a single week of unemployment. 
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Condescension toward these men is not in order, nor are glib as- 
sumptions that those who are cognitively disadvantaged cannot be 
productive citizens. The world is statistically tougher for them than 
for others who are more fortunate, but most of them are overcoming 
the odds. 


Chapter 8 
Family Matters 


Rum«>r.s of the death of the traditionell family have rauch truth in them for some 
jxirts of white American society —those with low cognitive ability and little ed- 
ucation—and much less truth for the College educated and very bright Amor- 
icans oj all educational levels. In this instance, cognitive ability and education 
appear to play mutually rein/orcing but also independent rotes. 

For marriage, the general rule is that the more intelligent get married. at 
higher rates than the less intelligent. This relationship, which applies across the 
ränge of intelligence t is obscured among peuple with high levels of education 
because College and graduate schont are powerful delayers of marriage . 

Divorce has lang been more [rrevalent in the lower socioeconornic and ed¬ 
ucational hrackets , but this tinns out to be expUiined better by cognitive level 
than by social Status . Once the marriage - break mg impact of low intelligence 
is taken intu accoimt, people of higher socioeconornic Status are more likely to 
get divorced than people of loiver Status. 

Illegitimacy, one of the central social Problems of the cimes, is strongly re¬ 
lated to intelligence. White women in the boturn 5 percent of the. cognitive 
ability distribution are six times as likely to have an illegitimate first child as 
those in the top 5 percent. One out of five of the legitimate first bahies of 
women in the bottom 5 percent was conceived prior to marriage, cornpared 
to fewer than one out of twenty of the legitimate babies to warnen in the top 
5 percent. Even among young women who have grown up in broken homes 
and among young women who are poor—both of which foster illegitimacy — 
low cognitive ability further raises the odds of giving birth illegitirnately. Low 
cognitive ability is a much stronger predisposing factor for illegitimacy than 
low socioeconornic background. 

At lower educational levels, a womans intelligence best predicts whether 
she will bear an illegitimate child. Toward the higher reaches of education, al¬ 
most no white women are having illegitimate children , whatever their family 
background or intelligence. 
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T he conventional understanding of troubles in the American fam- 
ily has several story lines. The happily married couple where the 
husband works and the wife stays home with the children is said to be 
as outmoded as the hüstle. Large proportions of young people are stay- 
ing single. Half the marriages end in divorce. Out-of-wedlock births are 
soaring. 

These features of modern families are usually discussed in the media 
(and often in academic presentations) as if they were spread more or less 
evenly across society. 111 In this chapter, we introduce greater discrimi- 
nation into rhat description. Unquestionably, the late twenrieth Cen¬ 
tury has seen profound changes in the structure of the tarn ily. But it is 
easy to misperceive what is going on. The differences across socioeco- 
nomic classes are large, and they reflect important differences by cog- 
nitive dass as well. 

MARR1AGE 

Marriage is a fundamental building block of social life and society it seif 
and thus is a good place to starr, because this is one area where much 
has changed and little has changed, depending on the vantage point 
one rakes. 

From a demographic perspective, the changes are huge, as shown in 
the next figure. The marriage rate since the 1920s has been volatile, but 
the valleys and peaks in the figure have explanations that do not nee- 
essarily involve the underlying propensiry to marry. The Great Depres¬ 
sion probably had a lot to do with the valley in the early 1930s, and 
World War II not only had a lot to do with the spike in the late 1940s 
but may well have had reverbe rat ions on the marriage rate that lasted 
into the 1950s. Itcouldevenbeargued that once these disruptive evenls 
are taken into account, the underlying propensity to marry did not 
change from 1930 to the early 1970s. The one prolonged decline for 
which there is no obvious explanation except a change in the propen¬ 
sity to marry began in 1973, when marriage rates per 1,000 women be- 
gan dropping and have been dropping ever since, in good years and bad. 
ln 1987, the nation passed a landmark: Marriage rates hit an all-time 
low, dropping below the previous mark set in the depths of the depres- 
ston. A new record was promptly set again in 1988. 

This change, apparently reflecting some bedrock shifts in attitudes 
toward marriage in postindustrial societies, may have profound signifi- 


ln the early 1970s, the marriage rate began a 
prolonged decline for no immediately apparent reason 


Marriages per 1,000 women 
120 - 
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Souvlvs: U.S. Bureau of the Census, 1975, TaMe B2 14- 2 15; SAttS, 1992, TuMe 127, and 
comparahle fahles in vurious edirions, 

cance. And yet marriage is still alive and well in the sense that it re- 
mains a hugely populär Institution. Over 90 pereent of Americans of 
both sexes have married by the time they reach their 40s." 

Marriage and IQ 

What does cognitive ability have to do with marriage, and is there any 
reason to think that it could he interacting with society’s declining 
propensity to marry? 

We know from work by Robert Rutherford thar in premodern soci¬ 
eties the wealthy and successful married at younger ages than the poor 
and underprivileged/ Retherford further notes that inte 11 igenee and so¬ 
cial Status are correlated wherever they have been examined; hence, we 
can assume that intelligence—via social Status—faeilitared marriage in 
premodern societies. 

With the ad vent of modernity, however, this rclationship flips over. 
Throughout the West since the nineteenth Century, people in the more 
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privileged sector ot society have married later and at lower rates rhan 
che less privileged. We examine the demographic implications of this 
phenomenon in Chapter 15. For now, the implieation is that in late- 
twermeth-century America, we should expect to find lower marriage 
rates among the highly intelligent in the NLSY. 

Everyday experience bears out this finding for people who live in aca- 
demic communities or Professional circles, where they see many smart 
men and women in their 30s and 40s who are still single and look as if 
they might stay that way forever. The intelligent Professional woman is 
the most visible of this new tribe, rising in her career, too husy for, or 
not interested in, marriage and children. Among men, other images 
have recently become part of the eulture: the intelligent, successful, and 
unmarried heterosexual male who cannot make a commirment and the 
intelligent, successful, and unmarried homosexual male who no longer 
needs to go through the motions of a marriage. 

Ar the other end of the scale, the re are similar reasons in research 
and common sense to suggest that marriage rates will tend to he low 
among people at the very bortom of the IQ distribution/ For a numher 
of reasons, having to do with everything from initiative to romance to 
economics, people with very low IQs are likely to be at a disadvantage 
in cornpeting for marriage partners. 

Our first look at the NLSY data conforms to these expectations, 
though not dramatically. The nexr table shows the Situation for the 
NLSY sample among whites who had reached the age of 30. The re were 
surprises in these results for us, and perhaps for so me of our readers. We 
would not have guessed that the average age of marriage for people in 
the top 5 percent of the Intelligence distribution was only 25, for ex- 
arnple. 1 ^ 1 A main point of the table is to introduce the therrte threaded 
throughout. the chapter; Our, your, and the media’s impressions of the 
state of the American family are not necessarily accurate. 

The Role of Socioeconomic Background 

Note in the tahle below that marriage percentages are highest for peo- 
ple in the middle of the intelligence distribution and taper off on borh 
ends. The same is true, though less dramatically, if the table is coiv 
structed by socioeconomic dass: The percentage of whites who had mar- 
ried before rhe age of 30 declines at hoth extremes. Furthermore, we 
have good reasons for thinking that this pattem is not a sampling finke 


Which Whites Get Married When? 


Percentage who 


Had Ever Married 
Before Age 30 

Cognitive Class 

Average Age at 
First Marriage 

67 

I Very brighr 

25.4 

72 

II Bright 

24.3 

81 

111 Normal 

22» 

81 

IV Du 11 

21.5 

72 

V Very dull 

21.3 

78 

Overall averages 

22.1 


hur reflects underlying dynamics of marriage. This pattem makes inter- 
preting regression results tricky, because the regression techniques we 
are using compute the lines in the graphs hased on the assumption that 
rhe lines are not trying to make U-turns. For the record: When we run 
the Standard initial analysis incorporating IQ, age, and socioeconomic 
Status as predictors of marriage, IQ has no significant independent role; 
th ere is a slight, Statistically insignificant downward probability of mar¬ 
riage as IQ goes up. Socioeconomic background has a much 1 arger siip- 
pressive role on marriage: The richer and beiter educated your parenrs, 
rhe less likely you are to marry, acc.ording to these results, which, again, 
must be interpreted cautiously. 

The Role oj Education 

The real culprit in explaining marriage rates in a young population is 
education. ln the rest of rhe chapters of Part II, we point out many in¬ 
stantes in which taking education into account does not much affect 
IQs independent role. Not so with marriage. When we take education 
into account, the apparent relationship reverses: The prohahility of mar- 
rying goes up, not down, for people with high IQs—a result found in 
other datahases as well. 161 Our Standard analysis with the two educational 
samples, high school graduates (no more and no less) and College grad- 
uates (no more and no less) elucidates this finding. 

The figure shows that neither IQ nor socioeconomic background was 
important in determining marriage for the College sample. In sharp con- 
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High IQ raises the probability of marriage for the white high school 
sample, while high socioeconomic background lowers it 

Probability of marriage by age 30 



Very low Very high 

(-2 SDs) ( + 2 SOst 


Note: For compuring the plot, age, and either SES (for the black curves) or IQ (hu the gray 
curves) were set ar their mean values. 


trast, IQ made a significant difterence in the high school sample. A high 
school graduate from an average socioeconomic background who was at 
the hottom of the IQ distribution (2 Standard deviations below the 
mean) had a 60 percent chance of having married. A high school 
graduate at the top of the IQ distribution had an 89 percent chance of 
having married. Meanwhile, the independent role of socioeconomic 
Status in the high school sample was either slightly negative or nil (the 
downward slope is not statistically significant). 

DIVORCE 

People marry, but do they stay married? Here is where the change has 
been not only dramatic but, some would say, cataclysmic, as shown he' 
low. In 1920, only death parted husbands and wives in about 82 percent 
of marriages and, in any given year (the datum shown in the next 
figure below), only about 8 out of 1,000 married females experienced a 
divorce. As late as 1964, despite the sweeping changes in technology, 
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The divorce revolution 

Divorces per 1,000 women 
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M 



1920 1930 1940 1950 1960 1970 1980 1990 

Snwnv.s; LJ.S. Bureau of the Census, 1975, Tahle B214-2J5; SAUS, 1992, Table 127, and 
comparablc rable in variotis edirions. 


wealth, and social life that had occurred in the intervening forty-four 
years, the numher was very little changed: 10 of every 1,000. The peak 
divorce rates just following World War 11 had fully suhsidcd, and the di¬ 
vorce rate still lay upon a trendline established herween 1920 and 1940. 

Thon came the revolution. The steep upward sweep of the divorce 
rate from the mid- 1960s through the end of the 1970s represents one of 
the most rapid, compressed changes in a Basic social behavior that the 
twentieth Century has witnessed. When the divorce rate hit its peak at 
the end of the 1970s, a marriage had more than a fifty-fifty chance of 
ending in divorce.' Despire a downward trend since 1980, divorce re- 
mains at twice the annual rate of the mid-1960s. 

Divorce and IQ 

We do not attempt to explain this profound change in our lives, which 
no doubt has roors in changing rnores, changing laws, changing roles of 
women, changing labor markets, and who knows what eise. Instead, we 
address the narrow question: How does divorce currently correlate with 
intelligente? 
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There are plausible reasons for expecting that cognitive ability will 
have an impacr on divorce. For example, one may hypothesize rhat: 
bright people less often marry on a whim, hence they have fewer disas- 
rrous short marriages. Bright people are perhaps less likely to act on im- 
pulse when the marriage has problems, hence are less likely to divorce 
precipitously du ring the first years of marriage. More general ly, it may he 
argued that brighrer people are her rer able to work out differences that 
might otherwise eventually destroy a marriage. We are, ot course, refer- 
ring to Statistical tendencies for which individual exceptions abound. 

Within the confines of the NLSY experience, these expectations are 
borne out to some degree, as shown in the tahle. The resuhs are based 


Which Whites Get Divorced When? 

Cognitive Class 

Percentage Divorced in First 
Five Years of Marriage 

1 Very bright 

9 

11 Bright 

15 

111 Normal 

23 

IV Hüll 

22 

V Very Juli 

21 

Ovetall averajjes 

20 


on the first five years of marriage. Those in Class 1 were ten titnes as 
likely to sray married for at least five years as to ger divorced; for those 
in C lasses III, IV, and V—the hottom three-quartersof the population— 
the ratio of marital survival to divorce for at least five years was only T5 
to 1 . |lS) Virtually all of the effeet of IQ seems to have been concentrated 
at the top of the distribution. The divorce rates across the hottom three- 
quarters of the cognitive ability distribution were essentially identical. 

The Role of Socioeconomic Background 

Do these findings hold up when we begin to add in other considera- 
tions?The figure below shovvs the results for the white sample who had 
been married at least five years. 191 The consistent finding, represented 
fairly by the figure, was that higher IQ was still associated with a lower 
probability of divorce after extracting the effeets of other variables, and 
parental SES had a significant positive relationship to divorce—that is, 
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IQ and socioeconomic background have opposite effects on the 
likelihood of an early divorce among young whites 

Probability of divorce in the first five years of marriage 
40 %- 

As IQ goes fwm low to high 

MWt - 
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from low to high 
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Nore: ln aJdinon to IQ, i»ßt\ und parental SES, rhe independent variables ineluded dato of 
first marriage. For computing rite plot, age, dare ot first marriage, and eiiher SES (for the 
hlaek eurve) or IQ (for ihe gray curve) were ser ar their mean values. 


IQ heing equal, children of higher-status families were more likely to 
get divorced than children of lower-stat.us families. 1101 

The Role of Education 

lt is clear ro all researchers who examine the data that higher education 
is associated with lower levels ot divorce. This was certainly true of the 
NLSY, where the College sample (persons with a bachelor’s degree, no 
more and no less) had a divorce rate in the first five years ot marriage 
that was less than half that of the high school sample: 7 percent com- 
pared to 19 percent. But this raw outcome is deceptive. 11 Holding some 
critical other things equal—IQ, socioeconomic Status, age, and date of 
marriage—rhe divorce rate for the high school graduates in the first five 
years of marriage was lower than for College graduates. 
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For whom did IQ make more difference: the high school sample or 
the College sample? The answer is the College sample, by far. For them, 
the probahility of divorce in the first five years plunged from 28 percent 
for someone with an IQ of 100 to 9 percent for someone wich an IQ of 
130. The much more minor effect. of IQ among high school graduates 
was not Statistical Ly significant. |ul 

Do Broken Families Beget Broken Families? 

One other cause ot divorce is mentioned so commonly that it requires 
exploration: a broken home in the preceding generation. The children 
of divorced parents have an elevated risk themselves of getting di- 
vorcedQ It is not hard to think of reasons why: Thev have not witnessed 
hovv a successful marriage works, they are more likely to see divorce as 
an acceptahle alternative, the t urbule nee of a failing marriage leaves 
psychological scars, and so (orth. lHl 

None of these reasons has an obvious connection with cognitive ahib 
ity. They could he valid without necessarily affecting the independent 
prophylactic rote that being smart plays in preventing (or perhaps sinn 
ply delaying) divorce. And so indeed it worked out in the NLSY. Civen 
a young person of average IQ and socioeconomic background, the prolv 
ability of divorce within the first five years of marriage was lowest for 
those who ar age 14 had beeri living with both parents (20 percent), a 
bit higher for those who had been living with a remarried parent (22 
percent), and higher still for those living with an urvremarried or neven 
married mother (25 percent). 1151 These are not large effects, however, 
and are not significant in a Statistical sense. We can say only that the 
results supported the general proposition that, when it comes to raising 
children who will themselves stay married, two adults as parents are gen- 
erally better than one and that two biological parents in the household 
are better than one or none. But it is worth noting that the inrroduc- 
tion of these variables did nothing to change the importance of the rest 
of the variables. Higher cognitive ability conferred just about as much 
protection from, and higher Status just as much risk for, divorce as in the 
preceding analyses. 

The NLSY gives us a window on the early years of marriage, though not 
necessarily about marriage as a whole. Based on national divorce rates, 
we know that most of the divorces that the members of the NLSY will 


experience have yet to occur. We will have to wait and see what hap- 
pens to the NLSY sample in later years. 

One final point about the divorce results is worth noting, however. 
These findings may help explain the common observation that divorce 
is less likely when the husband has high education, income, or socio- 
economic Status or that marriages are more likely to fall apart if they 
Start when the couple is afflicted with unemployment. 16 If we had 
showed a breakdown of divorce rates in the NLSY by social and eco¬ 
nomic measures alone, we roo would have shown such effects. But each 
of those variables is correlated with cognitive ability, and the studies 
that examine them almost never include an independent measure of in¬ 
te 11 igence per se. Some portion of what has so often been observed about 
the risk factors for divorce turns out to he more narrowly the result of 
low cognitive ability. 

ILLEG1TIMACY 

Childhearing touches on one of the most sensitive topics in the study 
of intelligence and its social consequences: fertility patterns among the 
smart and the dumb, and their possiblc long-term effects on the Intel- 
lectual Capital of a nation’s population. We devote a full chapter to this 
topic (Chapter 15) in the portion of the hook dealing with the national, 
multiracial perspective. In this chapter, the focus is on family problems, 
and one of the leading current problems is the failure of two-parent fam- 
ilies to form in the first place, as denoted by hirths to single women— 
illegitimacy. 

We use the older term “illegitimacy” in favor of the phrases currently 
in favor, “out-of-wedlock hirths” or “births to single women,” because 
we think that, in the long run, the word illegitimacy will prove to he 
the right one. We are instructed in this by the anthropologist Bronis- 
law Malinowski. ln his research early in the Century, Malinowski oh- 
served a constant running throughout the rieh diversity of human 
eultures and indeed throughout history. He decided that this amounted 
to “a universal sociological law” and called it the “principle of legiti- 
macy.” Nt) matter what the culture might be, “there runs the rule that 
the father is indispensable for the full sociological Status of the child as 
well as of the mother, that the group consisting of a woinan and her off¬ 
spring is sociülogically incomplete and illegitimate.” 17 The rule applied 
alike to East or West, primitive eultures or advanced ones, eultures 
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where premarital sex was accepted or banned, where children were con- 
sidered an asser or a bürden, where fathers could have one wife or many. 

Despite. our faith that Malinowski was observing something that will 
once again be considered true about human societies, ehe Contemporary 
Western democracies, including the United States, seem intent on 
proving Malinowski wrong, as shown in the nexr figure. 


The illegitimacy revolution 
Percentage of children bom out of wedlock 
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Source: Various eJiuons of che NhiUility volume of Viral Swri.sriL.s, eompiled annuallv hy the 
Public Health Service. 


ln the seventy-one years from 1920 to 1990, the proportion of chih 
dren horn to single women in the United States went from less than 3 
percent, roughly where it had been throughout American history, to 30 
percent. Elh| It would have been about 6 percent had the trendline es- 
tablished from 1920 to 1952 remained unchanged. The trendline shitted 
upward during the 1950s, hut not dramatically. If we had maintained 
the trendline established from 1952 to 1963, the United States would 
have had about 11 percent of births out ot wedlock in 1991. lnstead, the 
figure was 30 percent, the result of a steep, sustained increase thar gath- 


ered steam in the mid-1960s and continued into the early 1990s. The 
increase for the most recent available year, 1991, was one of the largest 
in history. There are no signs as we write that illegitimacy is reaching 
an asymptote. 

Anyone who is trying to understand social trends must also realize 
that the magic of compound interest has created an even rnore explo¬ 
sive rise in the population of unmarried mothers and children. ln 1960, 
for example, there were just 73,000 never-married mothers between the 
agesof 18 and 34. In 1980, there were 1.0 million. 110 * In 1990, there were 
approxinvately 2.9 millionJ 2ü) Thus the illegitimacy ratio increased by 
sixfold from 1960 to 1990—had enough—but the number of never- 
married mothers increased fortyfold. From just 1980 to 1990, while the 
illegitimacy ratio was increasing by half, the number of unmarried moth¬ 
ers almost tripled. 

Illegitimacy and IQ 

lf IQ is a factor in illegitimacy, as we will conclude it is, it must be in 
combination with other things (as common sense would suggest), be- 
cause IQ itself has not changed nearly enough in recent years to account 
for the explosive growth in illegitimacy.' 211 But we will also he explor- 
ing the possihility that some of these “other things” that have changed 
in the last three decades—broken homes and the welfare System being 
prime suspects—interact with Intelligence, making it still more likely 
than betöre that a wornan of low cognitive ability will have a baby out 
of wedlock. 

Among other reasons that cognitive ability may be related to illegit¬ 
imacy, we have this causal model in mind: The smarter a woman is, the 
more likely that she deliberately decides to have a child and calculates 
the best time to do it. The less intelligent the woman is, the more likely 
that she does not think ahead from sex to procreation, does not re- 
member to use birth control, does not carefully consider when and an¬ 
der what circumstances she should have a child. How intelligent a 
woman is may interner with her impulsiveness, and hence her ability to 
exert self-discipline and restraint on her partner in Order to avoid preg- 
nancy. The result is a direct and strong relationship between high in- 
telligence and the likelihood that a child is conceived after marriage, 
and between low intelligence and the likelihood that the child will be 
bom out of wedlock. 
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There are, of course, ohjections to this explanarion. Some will hris- 
tle at our identification of conception within marriage with the intelli¬ 
gent thing to do. But is it real ly controversial or even arguahle? Und er 
what circumstances can a thoughtful, coolheaded appraisal lead one ro 
conclude that it is hetter to conceive a child outside marriage? 1t such 
circumstances exist, are they not exceptional? Perhaps a woman vvants 
to conceive a child out ot marriage, but how likely is it that a disinter- 
ested person would consider it to be in the best interest ot all eoncerned, 
including the child’s? 

We begin our exploration with the overall numhers. First, how many 
white wo men are e rigaging in this behavior? As the next table shows, 
the differences among the cognirive classes are extremely large. Only 2 
percent ot white women in Class 1 had given hirth to an illegitimate 
child as ot the 1990 interview, eompared to 32 percent of the women in 
Class V. 


The lncidence of Illegitimacy Among 
Young White Women 

Cognitive Class 

Percentage Who 

Have Given Birth to an 
Illegitimate Baby 

I Very bright 

2 

11 Bright 

4 

111 Normal 

8 

IV Du 11 

17 

V Very dull 

U 

Overall average 

8 


Now we switch lenses. Inste ad of asking how many women ha ve ever 
had an illegitimate baby, we ask what proportion of first hahies horn to 
white women are illegitimate. The next table shows the results. The 
proportions of illegitimate first hirths in the top two cognitive classes 
are nearly the same, rounding to 7 percent—ahout half the proportion 
for Class III, a third of the proportion for Class IV, and a sixth of the 
proportion for Class V. lllegitimacy is again conspicuously concentrated 
in the lowest cognitive groups. 


The Proportion of White First Births 
That Are Illegitimate 


Cognitive Class 

I Very hright 

II Bright 

III Normal 

IV Hüll 

V Verydull 
Overall average 


Percentage of 
Illegitimate Births 

7 

7 

1J 
21 
42 
14 


The relationship between intelligence and illegitimacy is strong not 
only in these hasic respects, but also in more subtle ways, as the num¬ 
hers hased on the women s first hirths, shown in the next table, reveal. 


Circumstances of the First Birth Among Whites 


Born Illegitimate Born After Marriage 


Mother 

Mother 


Conceived 

Conceived 

Hasn’t 

Eventually 

Cognitive 

Bef o re 

After 

Married“ 

Married 1 ' 

Class 

Marriage 

Marriage 

3% 

4% 

1 Very bright 

4% 

89% 


4 

II Bright 

i.3 

80 


10 

111 Normal 

20 

67 

7 

16 

IV Dull 

22 

55 

17 

24 

V Very Juli 

12 

47 

4 

10 

Population averages 19 

68 


■' By ihe tum' ot the 1990 interview. 


Not only are ehildren of mothers in the top quartile of intelligence 
(Classes 1 and II) more likely to he horn within marriage, they are more 
likely to have been conceived within marriage (no shotgun wedding). 
The differences among the cognitive classes are large, as il they lived in 
different worlds. For the women in Class V, only 47 percent of the first 
ehildren were conceived after a marriage ceremony; for the wonien in 
(dass I, 89 percent. 

The tahle makes a strong prima facie case for a relationship between 
cognitive ability and illegitimacy. The question is whether it survives 
serutiny when we introduce other factors into the analysis. 1 ^ 1 
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The Role of Socioeconomic Background 

The socioeconomic background of a young woman was traditionally 
rhought ro he crucial in determining whether she höre a child out ot 
wedlock. The old-fashioned view of illegitimacy was that it occurred 
rnostly among girls from the lower classes, with occasional and scatv 
dalous slip'ups by higher-class “good girls” who “got in trouhle.” Bur dur- 
ing the last few decades, as births outside marriage became more 
common and as examples proliferated of film scars and career women 
who were choosing to have bahies wirhout husbands, an alternative 
view spread. The sexual revolution had obviously penetrated tu all lev- 
eis of society, it was argued, and births out ot wedlock were occurring at 
all levels of our sexually liberated society. 

The re were never any systematic data to support this view, but nei- 
ther did scholars rush to check it out. A 1980 article in the American 
Sociological Review on education and fertility reported that white women 
with less than a high school education were twenty times more likely 
to have a child out of wedlock than white women with at least a Col¬ 
lege degree, but illegitimacy was only a side issue in the arricle and the 
datum never got noticed in the public dialogue.“ 3 The relationship of 
teenage illegitimacy to social and cognitive factors was first treated in 
detail in an analysis of the High School and Reyond survey puhlished 
hy the RAND Corporation in 1988. 1241 The report revealed that more 
than threc-quarters of the teenage girls in this national sample who had 
habies while they were still of high school age came from families in the 
hottom half of the socioeconomic Stratum. More than half came from 
the bottom quartile. This finding also held true among just the white 
teenage girls who had bahies out of wedlock, with 70 percenr coming 
from the hottom halt of the socioeconomic dis tri but ion and only 12 per- 
cent from the top quartile. 12 ^ 1 The RAND study was also the first to re- 
veal that cognitive ability played an important role, independent of 
socioeconomic Status. 12h| 

The data from the NL8Y generally confirm those reported in the 
RAND analysis. On the surface, white illegitimacy is associated with 
socioeconomic Status: About 9 percent of habies of women who come 
from the upper socioeconomic quartile are illegitimate, comparcd to 
about 23 percent of the children of women who come from the hottom 
socioeconomic quartile. But white women of varying Status backgrounds 


differ in cognitive ability as well. Our Standard analysis with IQ, age, 
and parental SES as independent variables helps to clarify the Situation. 
The dependent variable is whether the first child was born out of wed- 
lock.™ 


IQ has a large effect on white illegitimate births independent 
of the mother’s socioeconomic background 


Probability of an illegitimate first birth 
40% - 



f-2 SDs) (+2 SDs) 

Note: Für computing the plnt, age and eicher SES (for the black curve) or IQ (for rhe gray 
curvc) were ser at rHeir mean values. 


Higher social Status reduces the chances of an illegitimate first hahy 
from about 19 percent for a woman who came from a very low Status 
family to about 8 percent for a woman from a very high Status family, 
given that the woman has average intelligence. Let us eompare that 11 
percentage point swing with the effect of an equivalent shift in intelli¬ 
gence (given average socioeconomic hackground). |2S] The odds of hav- 
ing an illegitimate first child drop {vom 34 percent for a very dull woman 
to ahout 4 percent for a very smart woman, a swing of 30 percentage 
points independent of any effect of socioeconomic Status. 
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The Rote of Educatian 

Without doubt, the number of wcll-educated women who are deliher- 
ately deciding to have a bahy out of wedlock—the name “Murphy 
Brown*’ comes to mind—has increased. The Bureau of the Census s most 
recent study of fertility of American women revealed that the percent- 
age of never-married women with a bachelors degree who had a baby 
had increased from 3 to 6 percent from 1982 to 1992 Rur during the 
same decade, the percentage of never-married women wirh less than a 
high school education who had a baby increased from 35 to 48 percent. k 
The role of education continues to be large. 

ln the NLSY, the statisrics contrasr even more starkly. Among white 
women in the NLSY who had a bachelors degree (no more, no less) and 
who had given birth toa child, 99 percent ot the habies were born within 
marriage. ln other words, there is virtually nt) independent role for IQ 
to play among women in the College sample. 1t is true that the women 
in that 1 percent who gave birth out of wedlock were more likely to have 
the lower test scores—independent ot any effect of their socioeconomic 
backgrounds—but rhis is of theoretical interest only. 

Meanwhile, for white women in the NLSY who had a high school 
diploma (no more, no less) and had given birth to a child, 13 percent 
of the children had been born out of wedlock (compared to 1 per- 
cenr for the College sample). For them, the independent role of IQ 
was as large as the one for the entire population (as shown in the 
preceding figure). A high school graduate with an IQ of 70 had a 34 
percent probability that the firsr bahy would be born out of wedlock; 
a high school graduate wirh an IQ of 130 had less than a 3 percent 
chance, after extracting the effects of age and socioeconomic back' 
ground. The independent effect of socioeconomic Status was com- 
paratively minor. 

The Role of Broken Haines 

We have already noted that family stnicture at the age of 14 had only 
modest influence on the chances of getting divorced in the NLSY sam¬ 
ple after controlling for IQ and parental SES. Now the question is hovv 
the same characteristic affects illegitimacy. Let us consider a white 
woman of average intelligence and average socioeconomic Background. 
The odds that her first child would he born out of wedlock were: 


10 percent if she was living with both biological parents. 

18 percent if she was living with a biological parent and a steppar- 
ent. 

25 percent if she was living with her mother (with or without a live¬ 
in Boyfriend). 

The difference between coming from a traditional family versus any- 
thing eise was large, with the stepfamily about halfway between the tra- 
ditional family and the mother-only family. 

As we examined the role of family structure with different break- 
downs (the permutations of arrangemenrs that can exist are numer- 
ous), a few patterns kept recurring. 1t seeined that girls who were still 
living with their biological father ar age 14 were protected from hav- 
ing their first baby out of wedlock. The girls who had been living with 
neither biological parent (usually living with adopted parents) were 
also protected. The worst outcomes seemed conspicuously associated 
with situations in which the M^yeur-old had been living with the 
biological mother bur not the biological father. Here is one such break- 
down. The odds that a white woman’s first baby would be born out 
of wedlock (again assuming average intelligence and socioeconomic 
Background) were; 

8 percent if the biological mother, but not the biological father, was 
absent by age 14. 

8 percent if both biological parents were absent at age 14 (mostly 
adopted children). 

10 percent if both biological parents were present at age 14- 

23 percent if the biological father was absent by age 14 but not the 
biological mother. 

There is considerable food for thought here, but we refrain from spec- 
ulation. The main point for our purposes is that family structure is clearly 
important as a cause of illegitimacy in the next generation. 

Did cognitive ability still continue to play an independent role? Yes, 
for all the different family configurations that we examined. Indeed, the 
independent effect of IQ was sometimes augmented by taking family 
structure into account. Consider the case of a young woman at risk, hav- 
ing lived with an unmarried biological mother at age 14- Given aver- 
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age socioeconomic hackground and an average IQ, the probahility that 
her first baby would be born out of wedlock was 25 percent. lt she had 
an IQ at the 98th centile (an IQ of 130 or above), the probabiliiy 
plunged to 8 percent. If she had an IQ at the 2d centile (an IQ of 70 or 
below), the probability soared to 55 percent. High socioeconomic Sta¬ 
tus offered weak protection againsr illegitimacy once IQ had heen taken 
into accountQ” 

The Role of Poverty and Welfare 

ln the next chapter, we discuss IQ in relation to welfare dopende nee. 
Here, we take up a common argument about welfare as a cause of iHo- 
gitimacy. lt is not that low IQ causes women to have illegitimate hahies, 
this argument suggests, but that the combination of poverty and wel¬ 
fare causes women to have illegitimate babies. The logic is that a poor 
woman who is assured of clothes, shelter, food, and medical care will 
take fewer precautions to avoid getting pregnant, or, once pregnant, will 
put less pressure on the baby’s father to marry her, than a woman who 
is not assured of support. There are two versions of the argument. One 
sees the welfare System as hribing women to have babies; they get preg¬ 
nant so rhey can get a welfare check. The alternative, which we find 
more plausible, is that the welfare check (and the collateral goods and 
Services rhat are part of the welfare System) enable: s women to do some- 
thing rhat many young women might naturally like to do anyway: bear 
children. 

The controversy about the welfare explanation, in either the ,l en- 
ahling” or “bribe" Version, has been intense, with many issues still un- 
resolved. 1 '“ 1 Whicheve r version is employed, the reason for focusing on 
the role of poverty is obvious: For affluent young women, the welfare 
System is obvious ly irrelevant. They are restrained from having babies 
out of wedlock hy moral considerations or by fear of the social penalties 
(both of which still exist, though weakened, in middle-class circles), by 
a concern that the child have a father around the house, and beeause 
having a baby would interfere with their plans for the future. ln the poor- 
est communities, having a baby out of wedlock is no longer subject to 
social Stigma, nor do moral considerations appear to carry much vveight 
any longer; it is not irresponsible to have a child out of wedlock, the ar¬ 
gument is more likely to go, beeause a single young woman can in fact 


support the child without the help of a hushandQ And that brings the 
welfare System into the picture. For poor young women, the welfare Sys¬ 
tem is highly relevant, easing the short-term economic penalties that 
might ordinarily restrain their childbearing. H The poorer she is, the 
more attractive the welfare package is and the more likely that she will 
rhink herseif enabled to have a baby by receiving it. 

Given th is argument and given rhat poverty and low IQ are related, 
ler us ask whether the apparent relationship between IQ and illegiti- 
macy is an artifact. Poor women disproportionately have low lQs, and 
bear a disproportionate number of illegitimate hahies. Control for the 
effects of poverty, says this logic, and the relationship between IQ and 
illegitimacy will diminish. 

Let us see. First, we ask whether the initial condition is true: Is hav- 
ing hahies out of wedlock something that is done disproportionately not 
only hy women who come from low socioeconomic hackgrounds (a fact 
which we al ready have discussed), hut wo men. who are literal ly poor 
themselves when they reach childhearing age? Even more specifically, 
are they disproportionately living below the poverty line hefore the hirth ? 
We use the italics ro emphasize a distinction that we helieve öfters an 
important new perspective on single motherhood and poverty. It is one 
tlüng to say that single women with hahies are disproportionately poor, 
as we discussed in Chapter 5. That makes sense, beeause a single woman 
with a child isoften not a viable economic unit. lt isquite another thing 
to say that wo men who are al ready poor hecome mothers. Now we are 
arguing that there is something about hemg in the state of poverty it- 
seif (after hold ing the socioeconomic Status in which they were raised 
consrant) that makes having a baby wirhout a husband attractive. 

To put the question in operational terms: Among NLSY white moth¬ 
ers who were below the poverty line in the year prior to giving hirth, 
what proporrion of the hahies were born out of wedlock? The answer is 
44 percent. Among NLSY white mothers who were anywhere above the 
poverty line in the year hefore giving birth, what proportion of the ha¬ 
hies were born out of wedlock? The answer is only 6 percent. lt is a huge 
difference and makes a prima fade case for those who argue that poverty 
itself, presumably via the welfare System, is an important cause of ille¬ 
gitimacy. 

But now we turn to the rest of the hypothesis: that Controlling for 
poverty will explain away at least some of the apparent relationship be- 
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tween IQ and illegitimacy. Here is the basic analysis—Controlling for 
IQ, parental SES, and age—restricted to white women who were poor 
the year betöre the hirth of their bahies . 1 ^ 1 

Compare the graph below with the one betöre it and two points about 
white poor women and illegitimacy are vividly clear. First, the inde- 

IQ is a more powerful predictor of illegitimacy among poor white 
women than among white women as a whole 

Probability that the first child will be bom out of wedlock 
for white women already below the poverty line 
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pendent importance of intelligence is even greater for poor white 
women than tor white women as a whole. A poor white woman of av ¬ 
erage socioeconomic background and average IQ has more than a 35 
percent chance ot an illegitimate first birth. For white women in gen¬ 
eral, average socioeconomic Status and IQ resulted in less than a 15 per¬ 
cent chance. Second, among poor women, the role of socioeconomic 
background in restraining illegitimacy disappears once the role of IQ is 
taken into account. 

The results, taken literally, suggest that illegitimacy tends to rise 
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among poor women who came from higher socioeconomic background 
after IQ is taken into account. However, the sample of white women in- 
c Indes roo tew wo men who fit all of the conditio ns (below the poverty 
line, from a good socioeconomic background, with an illegitimate haby) 
to make much of this. The more conservarive interpretarion is that low 
socioeconomic background, independent of IQ and current poverty it- 
seil, does not increase the chances of giving birth out of wedlock among 

poor white women—in itself a sufficiently provocative finding for soci- 

i ■ UH 
ologists. 

Our main purpose has been to demonstrate that low intelligence is 
an important independent cause of illegitimacy, and ro do so we have 
considered the role of poverty. In reality, however, we have also opened 
up many new avenues of inquiry that we cannot fully pursue without 
writing an entire hook on this subject alone. For example, the results 
raise many questions to be asked about the “culture of poverty” argu- 
ment. To the extern that a culture of poverty is at work, transmitting 
dysfunctional values from one generation ro the next, it seerns para- 
doxical that low socioeconomic background does not fester illegitimacy 
once poverty in the year prior to birth is hrought into the picture. 

Rut the main rask posed by these results is to tili in the reason for that 
exrremely strong relationship between low IQ and illegitimacy within 
the population of poor white women. The possibilities bear directly on 
sorne oi the core issues in the social policy dehate. For example, many 
people have argued that the weif are System cannot real ly be a cause of 
illegitimacy, hecause, in objective terms, the weif are System is a had deal. 
It provides only enough to squeak by, it can easily trap young women 
into long-term dcpendence, and even poor young women would be 
much better off by complering their education and getting a job rather 
than having a baby and going on welfare. The results we have presented 
can be interpreted as saying that the welfare System may be a had deal, 
hut it takes foresight and intelligence to understand why. For women 
without foresight and intelligence, it may seem to be a good deal. He nee 
poor young women who are bright tend not to have illegitimate bahies 
nearly as often as poor young women who are duü. 

Another possibility fits in with rbose who argue that the best pre- 
venrative for illegitimacy Ls better opportunities. It is not the welfare 
System that is at fault but rhe lack of other avenues. Poor young women 
who are bright are getting scholarships, or othervvise having positive in- 
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centives offered to them, and they aecordingly defer childhearing. Poor 
yonng women who are dull do not get such opporrunities; they have 
nothing eise to do, and so have a baby. The goal should be to provide 
them too with other ways of seeing their futures. 

Both ot these explanations are stated as hypotheses that we hope oth- 
ers will explore. Those explorations will have to ineorporate our cen¬ 
tral finding, however: Cognitive ability in itself is an important tactor 
in illegitimacy, and the dynamics for understanding illegitimacy—and 
dealing with it through policy—must take that strong link into account. 

THE SELECTIVE DETERIORATION OF THE TRADITIONAL 
FAMILY 

Our goal has been to sharpen understanding of the muchdamented 
breakdown of the American family. The American family has been as 
battered in the latter decades ot the twentieth Century as the public 
rhetoric would. have it, hut the damage as measured in terms ot divorce 
and illegitimacy has been tar more selective than we hear. By way ot 
summary, lex us consider the children ot the white NLSY mothers in the 
top quartile ot cognitive ability (('lasses l and II) versus those in the 
bottom quartile (Classes IV and V): 

• The percentage of households wich children that amsist of a manried cou- 
ple: 87 percent in the top quartile of IQ, 70 percenc in the bottom 
quartile. 

• The percentage of households with children that have cxperienced di- 
vorce: 17 percent in the top quartile ot IQ, 3 3 percent in the bot¬ 
tom quartile. 

• The percentage of children hom out of wedlock: 5 percent in the top 
quartile of IQ, 2 3 percent in the bottom quartile. 

The American family may be generally ander siege, as people often 
say. But it is at the bottom of the cognitive ability distribution that tts 
defenses are most visibly crumbling. 


Chapter 9 

Welfare Dependency 

People have had reason to assume for many years that welfare mothers are 
concentratedat the low end ofthe cognitive ability distribution, if only be cause 
they have generally done poorly in school. Beyond that, it makes sense that 
smarter women can more easily find Jobs and resist the temptations of welfare 
dependency than duller ones, even if they have giieri birth out of wedlock . 

The link is amfirmed in the NLSY. Over three-quarters of the white women 
who teere an welfare within a year of the. birth of their first child came from 
the bottom quartile of IQ, compared to5 percent from the top quartile. When 
tee subdivide welfare recipients into tu>o groups, “ temporary ” and “chronic,” 
the link persiscs, ihough differently for the two groups. 

Among warnen who received welfare temporarily , low IQ is a powerful 
risk factor even after the effects of marital Status, poverty, age, and socioecu- 
nomic background are statistically extracted. For chronic welfare recipiency, 
the story is more complicated. For practical purposes, white women with 
above-average cognitive ability or above-average socioeconomic background 
do not becomc chronic welfare recipients. Among the rcstricted sample oflow- 
1Q, low-SES, and relatively uneducated white women who are chrorücally 
on welfare, low socioeconomic background is a more powerful predictor than 
low IQ, even after taking account of whcther they teere themselves below the 
poverty line at the time they had their babies. 

The analyses Inovide some support for those teho argue that a culture of 
poverty tends to transmit chronic welfare dependency from one generation to 
the next. But if a culture of poverty is at work, it seerns to have in/Iuence pri- 
marily among women who are of low intelligence. 


A part from whether it causes increased illegitimacy, welfare has been 
a prickly topic in the social policy debate since shortly after the core 
welfare program, Aid to Families with Dependent Children (AFL9C), 
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was created in the midT9 30s. Originally AFDC was a populär idea. No 
one in the community was a likelier object of sympathy than the young 
widow with small children to raise, and AFDCseemed to he a way to help 
her stay home with her children until they were old enough to begin tak- 
ing care of her in their turn. And if some of the women going on AFDC 
had not been widowed hut abandoned by no-good husbands, mosr peo- 
ple thought that they should be helped too, though some people voiced 
concerns that helping such women undermined marriage. 

But hardly anyone had imagined that never-married women would 
be eligible for AFDC. It came as a distressingsurprise to Frances Perkins, 
the first woman cabinet member and a priinary Sponsor of the legisla- 
tion, to find that they were. 1 But not only were they eligible; within a 
few years after AFDC began, they constituted a large and growing por- 
tion of the caseload. This created much of the general public’s antago- 
nism toward AFDC: It wasn’t just the money, it was the principle of the 
thing. Why should hardworking citizens support immoraliry? 

Such complaints about welfare go far back into the 1940s and even 
the 1930s, but, at least from our perspective in the 1990s, it was much 
ado about a comparatively small problem, as the next figure shows. After 


The welfare revolution 
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a slow and meandering rise since the end of World War II, the welfare 
caseload was still less than 2 percent of families when John F. Kennedy 
took otfice. Then, as with so many other social phenomena, the dy- 
namics abruptly changed in mich 1960s. In a concentrated period from 
1966 to 1975, the percentage of American families on welfare nearly 
tripled. The growth in the caseload then stopped and even declined 
slighrly through the 1980s. Welfare rolls have been rising steeply since 
1988, apparently beginning a fourth era. As of 1992, more than 14 mih 
lion Americans were on welfare. 

The steep rise in the welfare population is obviously not to be ex- 
plained hy intelligence, which did not plummet in the 1960s and 1970s. 
More fundamental forces were reshaping the social landscape during 
that time. The surging welfare population is just one outcropping among 
others summarized in Part II of trouhle in American society. In this chap- 
ter, the eherne will be, as it is elsewhere in the book, that as society 
changes, some people are especially vulnerable to the changes—in this 
instance, to events that cause dependence on welfare. We show here 
that low intelligence increases a white mothers risk of going on welfare, 
independent of the other factors rhat mighr be expected ro explain away 
the relationship. 

IQ AND WELFARE 

It has not been an openly discussed topic, hut there are many good rea- 
sons for assuming thar welfare mothers come mainly from the lower 
reaches of the distribution of cognitive ability. Women on welfare have 
less education than women not on welfare, and chronic welfare recipi- 
ents have less education than nonchronic recipients/ Welfare mothers 
have been estimated to have reading skills that average three to five 
years below grade leveld Poor reading skills and little schooling define 
populations with lower-than-average IQ, so even without access to IQ 
tests, it can be deduced that welfare mothers have lower-than-average 
intelligence. Bur can it be shown that low IQ has an independent link 
with welfare itself, after taking account of che less direct links via being 
poor and being an unwed mother? 4 

By a direct link, we mean something like this: The smarter the woman 
is, the more likely she will be able to find a job, the more likely she will 
he able to line up other sources of support (from parents or the father of 
the child), and the more farsighted she is likely ro be about the dangers 
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of going on welfare. Even within the population of women who go on 
welfare, cognitive ability will vary, and the smarter ones will be better 
able to get off. 

No database until the NLSY bas offered the chance to test these hy- 
potheses in detail for a repräsentative population. We hegin as usual 
with a look at the unadorned relationship with cognitive dass. 

Use of welfare is uncommon but not rare among these white motlv 
ers, as the tahle below shows. Overall, 12 percent of the white mothers 
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in the NLSY received welfare within a year of the birth ot their first 
child; 9 percent had become chronic recipients by our definition of 
chronic welfare recipients (meaning that they had reported at least tive 
years of welfare income). Overall, 21 percent of white mothers had re- 
ceived assistance from AFDC at some point in their livesd 1 The differ- 
ences among the cognitive classes are large, with a conspicuously large 
jump in the rates at the bottom. The proportion of women in Class IV 
who became chronic welfare recipients is double the rate for Class 111, 
with another big jump for Class V, to 31 percent of all mothers. 

This result should come as no surprise, given what we already know 
about the higher rates of illegitimate births in the lower half of the cog¬ 
nitive ability distribution (Chapter 8). Women without husbands are 
most at risk for going on welfare. We also know that poverty has a strong 
association with the birth Status of the child. In fact, it may be asked 


whether we are looking at anything except a reflection of illegirimacy 
and poverty in these figures. The answer is yes, but a somewhat differ¬ 
ent “yes” for periodic and for chronic welfare recipiency. 

GOING ON WELFARE AFTER THE BIRTH OF THE FIRST CHILD 

First, we ask of the odds that a woman had received welfare by the end of 
the first calendar year after the birth of her first child.* 61 In all cases, we 
limit the analysis to white women whose first child was born prior to 
1989, so that all have had a suffident “chance” to go on welfare. 

lf we want to understand the independent relationship hetween IQ 
and welfare, the Standard analysis, using just age, IQ, and parental SES, 
is not going to teil us much. We have to get rid of the confounding cf- 
fects of being poor and unwed. For that reason, the analysis that yielded 
the figure below extracted the effects of the mantal Status of the rnother 

Even after poverty and marital Status are taken into aceount, 

IQ played a substantial role in determining whether white 
women go on welfare 


Probability of going on welfare within a year after birth 
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Note: For computing the plot, age and either SES (for the hlack curves) or IQ (for the gray 
curves) wereser at their mean values. Additional independent variables of which the effec ts 
have heen extracted for the plot: marital Status at the time ot first birth, and poverty Status 
in the calendar year prior to the first birth. 
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and whether she was below ehe poverty line in the year betöre hirth, in 
addition to rhe usual three variables. The dependent variable is whether 
rhe morher received welfare henefits during rhe year after the hirth ot 
her tirst child. As the black line indicates, cognitive ability predicts go- 
ing on welfare e.ven after the eftects ot marital Status and poverty have 
been extracted. This finding is worth thinking about, for it is not inru- 
itively predictable. Presumably much of the impact ot low intelligence 
on he ing on welfare—the failure to look ahead, to consider conse- 
quences, or to get an education—is already captured in the taer rhat the 
woman had a baby out of wedlock. Other elements of competence, or 
lack ot it, are captured in the fact that the woman was poor betöre the 
baby was born. Yet Holding the eftects of age, poverty, marital Status, 
and parental SES constanr, a white woman with an IQ at rhe 2d cen- 
tile had a 47 percent chance of going on welfare, compared to the 8 per- 
cent chance facing a white woman at the 98ch centile. 

The socioeconomic background of these mothers was not a Statisti¬ 
cal ly significant factor in their going on welfare. 

The Rnle of Education 

We cannor analyze welfare recipiency among white women with a bach- 
elors degree heeause it was so rare: Of the 102 white mothers with a 
B.A. (no more, no less) who met the criteria for the sample, 101 had 
never received any welfare. But we ean take a look at the high school 
sample. For them, low cognitive ability was as decisive as for the entire 
population of NLSY white mothers. The magnitude of the independent 
effect of IQ was about the same, and the eff ec t of socioeconomic Status 
was again Statistically insignificant. The other variables swept avvay all 
ot the Connections between welfare and social dass that seem so evi¬ 
dent in everyday life. 

CHRONIC WELFARE DEPENDENCY 

Now we focus on a subset of women who go on welfare, rhe chronic 
welfare recipients. They constitute a world of their own. In the course 
of the furious political and scholarly struggle over welfare during the 
1980s, two stähle and consistent findings emerged, each having different 
implications: Taking all rhe women who ever go on welfare, the aver¬ 


age spell lasts only about two years/ Ruf among never-married mothers 
(all raees) who had their hahies in their teens, the average time on weh 
fare is eight or more years, depending on the sample he ing investigated/ 

The white women who had met our detinition of chronic welfare re¬ 
cipient in the NLSY hy the 1990 interview fit this profile to somc ex¬ 
tern. For example, of the white women who gave hirt h to an tl legitimate 
baby before they were 19 (that is, they probably got pregnant hefore they 
would normally have graduated from high school) and stayed single, 22 
percent hecame chronic welfare recipients hy our definition—a high 
perccntage compared ro women at large. On the other hand, 22 percent 
is a long vvay from 100 percent. Even if we restrict rhe criteria further 
so that we are talking about single teenage mothers who were helow the 
poverty line, the prohahility ot becoming a chronic welfare recipient 
goes up only ro 28 percent. 

To ger an idea of how restricted the population ot chronic welfare 
mothers is, consider the 152 white women in the NLSY who met our 
definition of a chronic welfare recipient arid also had IQ scores. None 
of them was in Cognitive (dass 1, and only five were even in Class II. 
Only live had parents in the top quartile in socioeconomic class. One 
lone woman ot the 1 52 was from the top quartile in ability and from the 
top quartile in socioeconomic background. White women with above- 
average cognitive ability or socioeconomic background rarely beeome 
chronic welfare recipients. 

Keep ing this tight restricrion ot ränge in mind, consider what hap¬ 
pe 1 ns when we repeat the previous analysis (including the extra variables 
Controlling for marital Status and poverty at the time of tirst hirth) but 
this time comparing mothers who hecame chronic welfare recipients 
with women who never received any welfareA 1 According to the tigure, 
when it comes to chronic white welfare mothers, the independent ef¬ 
fect of the young woman’s socioeconomic background is substantial. 
Whether it becomes more important than IQ as the figure suggests is 
doubtful (the corresponding analysis in Appendix 4 says no)» butclearly 
the role of socioeconomic background is different for all welfare recip¬ 
ients and chronic ones. We spent much time exploring this shift in rhe 
role of socioeconomic background, to try to pin down what was going 
on. We will not describe our investigation with irs many interest ing hy- 
ways, instead simply reporting where we came out. The answer tums out 
to hinge on education. 



198 Cognittve Classes and Social Behavior 


Welfare Dependency 199 


Socioeconomic background and IQ are both important 
in determining whether white women become 

chronic welfare recipients _ 

Probability of being a chronic welfare recipient 
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The Role of Education 

White chronic welfare recipients are virtually all women with modest 
education at best, as set out in the next table. More than half ot the 
chronic welfare recipients had not gotten a high school diploma; only 
six-tenths of 1 percent had gotten a College education. As in the case 
of IQ and socioeconomic Status, this is a radically unrepresentative sam¬ 
ple of white women. 1101 It is obviously impossihle (as well as unneces- 
sary) to analyze chronic welfare recipiency among College graduates. 

The women for whom socioeconomic background was the main risk 
factor for being chronically on welfare are those who had not finished 
high school. For women with a high school diploma or more, IQ was 


Educational Attainment of White 
Chronic Welfare Recipients 

Highest Degree 
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Advanced degree 

0 

B.A. or B.S. 

1 

Associate degree 

3 

High school diploma 

42 

GED 

16 

Lcss than high school 

38 


more important than socioeconomic Status (other things equal) in af- 
tecting the probability of becoming a chronic welfare recipient. 1111 

Why ? Apparently rhe women who did not finish high school and had 
an illegitimate child were selected for low intelligence, especially if they 
had the child while still in high school. 1121 The average IQ of these 
women was about 91, and analysis teils us that further Variation in cog- 
nitive ahility does not have much power to predict which ones hecome 
chronic welfare cases. |M1 Instead, for this narrowly screened group of 
women, family background matters more. Without trying to push the 
analysis much further, a plausible explanation is that for most white 
American parents, having a school-aged child go on welfare is highly 
srigmatizing to them. If the daughter of a working-class or middle-class 
couple has an illegitimate baby out of wedlock while still in high school, 
chances are that her parents will take over Support for the new baby 
rather than let their daughter go on welfare. The parents who do not 
keep their school-aged daughter off welfare will tend to he those who 
are not deterred by the Stigma or who are themselves too poor to Sup¬ 
port the new baby. Both sets of parents earn low scores on the socio- 
economic Status index. Hence what we are observing in the case of 
chronic welfare recipiency among young women who do not finish high 
school may reflect parental behavior as much as the young mother’s he- 
havior. 1M1 

Other hypotheses are possihle, however. Generally these results pro- 
vide evidence ft^r those who argue that a culture of poverty transmits 
chronic welfare dependency from one generation to the next. Our 
analysis adds that women who are susceptible to this culture are likely 
to have low intelligence in the first place. 
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DRAWING TOGETHER THE EINDINGS ON ILLEGITIMACY 
AND WELFARE 

As social scientists often do, we have spent much effort burrowing 
through analyses rhat ultimately point to simple conclusions. Here is 
how a grear many parents around America have put ir to their daugh- 
ters: Having a baby without a husband is a dumb thing to do. Going on 
welfare is an even dumber thing to do, if you can possihly avoid it. And 
so it would seem to be among the white women in the NLSY. White 
women who remained childless or had habies within marriage had a 
mean IQ of 105. Those who had an illegitimate baby but never wem on 
welfare had a mean IQ of 98. Those who went on welfare but did not 
become chronic recipients had a mean IQ of 94. Those who became 
chronic welfare recipients had a mean IQ of 92. llS1 Altogether, almost a 
Standard deviation separated the IQs of white women who hecame 
chronic welfare recipients trom those who remained childless or had 
children within marriage. 

In Chapter 8, we demonstrated that a low IQ is a factor in illegiti¬ 
mate births that cannot be explained away hy the woman’s socio- 
economic background, a broken family, or poverty at the time the child 
was conceived. In particular, poor women of low intelligence seemed 
especially likely to have illegitimate babies, which is consistent with the 
idea that the prospect of welfare looms largest for women who are think- 
ing least clearly ahout their futures. In this chapter, we have demon¬ 
strated that even among women who are poor and even among those 
who have a baby without a husband, the less intelligent tend ro be the 
ones who use the welfare System. 

Two qualifications to this conclusion are that (1) we have no vvay 
of knowing whether higher education or higher IQ explains why 
College graduates do not use welfare—all we know is that welfare is 
almost unknown among college-educated whites, but that for women 
with a high school education, intelligence plays a large independent 
role—and (2) for the low-IQ women without a high school education 
who become chronic welfare recipients, a low socioeconomic Back¬ 
ground is a more important predictor than any further influence of 
cognitive ability. 

The remaining issue, which we defer to the discussion of welfare pol- 
icy in Chapter 22, is how to reconcile two conflicting possibilities, borh 


of which may have some truth to them: Going on welfare real ly is a 
dumb idea, and that is why women who are low in cognitive ability end 
up there; but also such women have litt.le to take to the job market, and 
welfare is one of their few appropriate recourses when they have a baby 
to care for and no husband to help. 






Chapter 10 

Parenting 


Evertont' agrees, in the abstract and at the extremes, that there is good par¬ 
enting and paar parenting. This chapter addresses the uncomfortable ques- 
tion: Is the competence of parents at all affected by how intelligent they are? 

It has been known for some time that socioeamomic dass and parenting 
are linked, both tu disciplmary practices and to the many ways in which the 
intellectual and Jt/orui/ development of the child are fostered. On both 
counts, parents with higher socioeconomic Status look beiter. At the other end 
of the parenting cuntinuum, neglect and abuse are heavily concentrated in the 
lower socioeconomic classes. 

Whenever an IQ measure has been introduced in to studies of parent-child 
relationships, it has explained away much of the differences that otherwise 
would have been attributed to education or social dass, but the examples are 
sparse. The NLSY provides an opportunity to fill in a few of the gaps. 

With regard to prenatal and infam care y low IQ among the white mothers 
in the NLSY sample was related to low birth weight, even öfter controllmg for 
socioeconomic background, poverty, and age of the mother. In the NLSY's 
surveys of the home environment, mothers in the top cognitive classes pro- 
vided, on average, beeter environments for children than the mothers in the 
bottom cognitive classes. Socioeconomic background and current poverty also 
played significant roles, depending on the specific type of measure and the age 
of the children. 

In the NLSY's measures of child development, low matemal IQ was asso- 
ciated with problematic temperament in the bahy and with low scores on an 
iudex of li friendlinesswith paar motor and social development of toddlers 
and with behavioral Problems from age 4 on up. Poverty usually had a müd¬ 
est independent wie but did not usually diminish the contribution of IQ (which 
was usually also müdest). Predictably, the mothers IQ was also strongly re¬ 
lated to the IQ of the child. 

Taking these data together, the NLSY results say clearly that high IQ is by 
no means a prerequisite /or being a good mother. The disquieting finding is 
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that the worst environments for raising children, of the Und that not even the 
most resilient children can easily overcome, are concentrated in the homes in 
which the mothers are at the low end of the intelligence distribution . 

P arenting, in one sense the most private of behaviors, is in another 
the most public. Parents make a difference in the way their children 
turn out—whether they become law abiding or criminal, generous or 
stingy, productive or dependent. How well parents raise their children 
has much to do with how well the society functions. 

But how are parents to know whether they are doing a good or a bad 
job as parents? The results seem to be hopelessly unpredictable. Most 
people know at least one couple who seem to be the ideal parents but 
whose teenage child ends up on drugs. Parents with more than one child 
are bemused by how differently their children respond to the same home 
and parental style. And what makes a good parent anyway? Most peo¬ 
ple also have friends who seem to be raising their children all wrong, 
and yet the children flourish. 

The exceptions notwithstanding, the apparent unpredictability of 
parenting is another of those illusions fostered by the ground-level view 
of life as we live it from day to day. Parenting is more predictable in the 
aggregate than in the particular. The differences in parenting style that 
you observe among your friends are usually minor—the “restriction of 
ränge” problem that we discussed in Chapter 3. A middle-class mother 
may think that one of her friends is far too permissive or strict, but put 
against the full ränge of Variation that police and social workers are 
forced to deal with, where “permissiveness” is converted into the num- 
ber of days that small children are left on their own and “strictness” may 
be calibrated by the number of stitches required to close the wounds 
from a parental beating, the differences between her and her friend are 
probably small 

Despite all the differences among children and parents, there is such 
a thing as good parenting as opposed to bad—not precisely defined but 
generally understood. Our discussion proceeds from the assumption that 
good parenting includes (though is not restricted to) seeing to nourish- 
ment and health, keeping safe from harm, feeling and expressing love, 
talking with and listening to, helping to explore the world, imparting 
values, and providing a framework of rules enforced consistently but not 
inflexibly. Parents who more or less manage to do all those things, we 


assert, are better parents than people who do not. The touchy question 
of this chapter is: Does cognitive ability play any role in this? Are peo¬ 
ple with high IQs generally better parents than people with low IQs? 

SOCIAL CLASS AND PARENTING STYLES 

The relationship of IQ to parenting is another of those issues that so¬ 
cial scientists have been slow to investigate. Furthermore, this is a topic 
for which the NLSY is limited. For unemployment, school dropout, il- 
legitimacy, or welfare recipiency, the NLSY permits us to cut directly to 
the question, What does cognitive ability have to do with this behav¬ 
ior? But many of the NLSY indicators about parenting give only indi- 
rect evidence. To interpret that evidence, it is useful to begin with the 
large body of studies that have investigated whether social dass affects 
parenting. Having described that relationship (which by now is rea- 
sonably well understood), we will be on firmer ground in drawing in- 
ferences about cognitive ability. 

The first scholarly study of parenting styles among parents of differ¬ 
ent social classes dates back to 1936 and a White House Conference on 
children. 1 Ever since, the anthropologists and sociologists have told sim- 
ilar stories. Working-class parents tend to be more authoritarian than 
middle-class parents. Working-class parents tend to use physical pun- 
ishment and direct commands, whereas middle-class parents tend to use 
reasoning and appeals to more abstract principles of behavior. The con- 
sistency of these findings extends from the earliest studies to the most 
recent. [2] 

In an influential article published in 1959, Melvin Kohn proposed 
that the underlying difference was that working-class parents were most 
concerned about qualities in their children that ensure respectability, 
whereas middle-class parents were most concerned about internalized 
Standards of conduct. 3 Kohn argued that the real difference in the use 
of physical punishment was not that working-class parents punish more 
but that they punish differently from middle-class parents. Immediate 
irritants like boisterous play might evoke a whack from working-class 
parents, whereas middle-class parents tended to punish when the intent 
of the child’s behavior (knowingly hurting another child, for example) 
was problematic. 4 Kohn concluded that a the working-class orientation 
. . . places few restraints on the impulse to punish the child when his be¬ 
havior is out of bounds. Instead, it provides a positive rationale for pun- 
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ishing the child in precisely those circumstances when one might most 
like to do so.” 5 To put it more plainly, Kohn found that working-class 
parents were more likely to use physical punishment impulsively, when 
the parents themselves needed the relief, not when it was likely to do 
the child the most good. 

The middle-class way sounds like “better” behavior on the part of pan 
ents, not just a neutral socioeconomic difference in parenting style, and 
this raises a point that scholars on child development bend over back' 
ward to avoid saying explicitly: Generally, and keeping in mind the 
many exceptions, the conclusion to be drawn from the literature on pan 
enting is that middle-class people are in fact better parents, on average, 
than working-class people. Readers who bridle at this Suggestion are uv 
vited to reread the Kohn quotation above and ask themselves whether 
they can avoid making a value judgment about it. 

Parenting differences among the social classes are not restricted to 
matters of discipline. Other major differences show up in the intellec- 
tual development of the child. Anthropologist Shirley Brice Heath 6 
gives vivid examples in her description of parenting in “Roadville,” a 
white lowenclass community in the Carolinas, versus “Gateway,” a 
nearby community of white middle-class parents. [7] The parents of Road- 
ville were just as devoted to their children as the parents of Gateway. 
Roadville newborns came home to nurseries complete with the same 
mobiles, pictures, and books that the Gateway babies had. From an early 
age, Roadville children were held on laps and read to, talked to, and 
otherwise made as much the center of attention as Gateway babies. But 
the interactions differed, Heath found. Take bedtime stories, for exauv 
ple. In middle-class Gateway, the mother or father encouraged the chil- 
dren to ask questions and talk about what the stories meant, pointing 
at items on the page and asking what they were. The middle-class pan 
ents praised right answers and explained what was wrong with wrong 
ones. 8 It is no great Stretch to argue, as Robert Sternberg and others do, 
that this interaction amounts to excellent training for intelligence tests. 
Lowenclass Roadville parents did not do nearly as much of that kind of 
explaining and asking. 9 When the children were learning to do new 
tasks, the Roadville parents did not explain the “how” of things the way 
the Gateway parents did. Instead, the Roadville parents were more 
likely to issue directives (“Don’t twist the cookie cutter”) and hardly 
ever gave reasons for their instructions (“If you twist the cutter, the 
cookies will be rough on the edge”). 10 


When they got to school, the Roadville and Gateway children com 
tinued to differ. The working-class Roadville children performed well 
in the early tasks of each of the first three grades. They knew the ah 
phabet when they went to kindergarten; they knew how to sit still in 
dass and could perform well in the reading exercises that asked them 
to identify specific portions of words or to link two items on the same 
page of the book. But if the teacher asked, “What did you like about the 
story?” or “What would you have done if you had been the child in that 
story?” the Roadville children were likely to say “I don’t know” or shrug 
their shoulders, while the middle-class Gateway children would more 
often respond easily and imaginatively. 11 

Heath’s conclusions drawn from her anthropological observations are 
buttressed by the quantitative work that has been done to date. A review 
of the technical literature in the mkb 1980s put it bluntly: “It is an empir- 
ical fact that children from relatively higher SES families receive an inteb 
lectually more advantageous home environment. This finding holds for 
white, black, and Hispanic children, for children within lower- and 
middle-SES families, as well as for children bom preterm and fulhterm.” 12 

SOCIAL CLASS AND MALPARENTING 

To this point, we have been talking about parenting within the normal 
ränge. Now we tum to child neglect and child abuse, increasingly la- 
beled “malparenting” in the technical literature. 

Abuse and neglect are distinct. The physical battering and other 
forms of extreme physical and emotional punishment that constitute 
child abuse get most of the Publicity, but child neglect is far more com¬ 
mon, by ratios ranging from three to one to ten to one, depending on 
the study. 13 Among the distinctions that the experts draw between child 
abuse and neglect are these: 

• Abuse is an act of Commission, while neglect is more commonly 
an act of omission. 

• Abuse is typically episodic and of short duration; neglect is chronic 
and continual. 

• Abuse typically arises from impulsive outbursts of aggression and 
anger; neglect arises from indifference, inattentiveness, or being 
overwhelmed by parenthood. [14] 
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Commonly, neglect is as simple as failure to provide a child with ad- 
equate food, clothing, shelter, or hygiene. But it can also mean leaving 
dangerous materials within reach, not keeping the child away from an 
open window, or leaving toddlers alone for hours at a time, lt means not 
taking the child to a doctor when he is sick or not giving him the med- 
icine the doctor prescribed. Neglect can also mean more subtle depri- 
vations: habitually leaving babies in cribs for long periods, never talking 
to infants and toddlers except to scold or demand, no smiles, no bed- 
time stories. At its most serious, neglect becomes abandonment. 

Are abusing parents also neglectful? Are neglectful parents also abu- 
sive? Different studies have produced different answers. Child abuse in 
some bizarre forms has nothing to do with anything except a profoundly 
deranged parent. Such cases crop up unpredictably, independent ot de- 
mographic and socioeconomic variables. 1 ’ 

Once we move away from these exceptional cases, however, abuse 
and neglect seem to be more alike than different in their originsC The 
theories explaining them are complex, involving stress, social isolation, 
Personality characteristics, Community characteristics, and transmis- 
sion of malparenting from one generation to the next. 1 ' But one coiv 
comitant of malparenting is not in much dispute: Malparenting of either 
sort is heavily concentrated in the lower socioeconomic classes. lndeed, 
the link is such that, as Douglas Besharov has pointed out, behaviors 
that are sometimes elassified as forms of neglect—letting a child skip 
school, for example—are not considered neglectful in some poor conv 
munities but part of the normal pattern of upbringing. 18 What would be 
considered just an overenthusiastic spanking in one neighborhood 
might be called abuse in another. 

We realize that once again we are contradicting what everyone 
knows, which is that “child abuse and neglect afflict all communities, 
regardless of race, religion, or economic Status,” to pick one formulation 
of this common belief. N And in a narrow technical sense, such State¬ 
ments are correct, insofar as neglect and abuse are found at every social 
and economic level, as is every other human behavior. lt is also correct 
that only a small minority of parents among the poor and disadvantaged 
neglect or abuse their children. But the way such Statements are usually 
treated in the media, by politicians, and by child advocacy groups is to 
imply that child neglect and abuse are spread evenly across social classes, 
as if children have about an equal chance of being abused or neglected 
whether they come from a rieh home or a poor one, whether the mother 


is a College graduate or a high school dropout. And yet from the earli- 
est studies to the present, malparenting has been strongly associated 
with socioeconomic dass. 

The people who argue otherwise do not off er data to make their case. 
Instead, they argue that child neglect and abuse are reported when it 
happens to poor children but not rieh ones. Affluent families are be- 
lieved to escape the reporting net (by using private physicians, for ex¬ 
ample, who are less likely to report abuse). Social Service agencies are 
said to be reluctant to intervene in affluent families. 20 Poor people are 
likely to be labeled deviant for behaviors that would go unnoted or un- 
remarked in richer neighborhoods/ 1 People are likely to think the worst 
of socially unattractive people and give socially attractive people the 
benefit of the douht. 22 

Studies spread over the last twenty years have analyzed reporting bias 
in a variety of ways, including surveys to identify abuse that goes unre- 
ported through official channels. The results are consistent: The so¬ 
cioeconomic link with maltreatment is authentied Probably the link is 
stronger for neglect than for abuse. 24 But specifying exactly how strongly 
socioeconomic Status and child maltreatment are linked is difficult be- 
cause ot the genuine shortcomings of official reports and because so 
many different kinds of abuse and neglect are involved. The following 
numbers give a sense of the Situation: 

• ln an early national study (using data for 1967) 60 percent of the 
families involved in abuse incidents had been on welfare during 
or prior to the study year. 25 

• ln data on 20,000 validated reports of child abuse and neglect col- 
lected by the American Humane Association for 1976, half of the 
reported families were below the poverty line and most of the rest 
were concentrated just above it. 26 

• In a 1984 study of child maltreatment in El Paso, Texas, 87 
percent of the alleged perpetrators were in families with in- 
comes under $18,000, roughly the bottom third of income. 
Seventy-three percent of the alleged female perpetrators were 
unmarried. 27 

• In the federally sponsored National Incidence Study in 1979, 
which obtained information on unreported as well as reported 
cases, the families of 43 percent of the victims of child abuse or 
neglect had an income under $7,000, compared to 17 percent of 
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Other Precursors of Maltreatment 


Prernature births, low birth weight, and illegitimacy also have links wich 
maltreatment. Studies in America and Britain have found rates of low hirth 
weight among abused children running at rhree to four times che national 
average.' 9 Prematurity has been found to be similarly disproportionate 
among abused children. 10 The proportion of neglected children who are ib 
legitimate has run far above national averages in studies from the early 
1960s onward. More than a quarter of the neglected children in the mid- 
1960s were illegitimate, for example—almosr four times the national Pro¬ 
portion. M In a British sample, 36 percent of the neglected children were 
illegitimate compared to 6 percent in the control groupA 


all American children. Only 6 percent of the abusive or neglecr- 
tul families had incomes of $25,000 or more. 

• The 1986 replication of the National Incidence Study found thar 
the rate of abuse and neglect among families with incomes under 
$15,000 was five times that of families with incomes above 
$15,000. Only 6 percent of the families involved in neglect or 
abuse had incomes above the median tor all American familiesA 

Given the one-sided nature of the evidence, why has the “myth of 
classlessness,” in Leroy Peltons phrase, been so tenacious? Pelton him- 
seif blamed social Service Professionals and politicians, arguing that hoth 
of these powerful groups have a vested interest in a medical model of 
child abuse, in which child abuse falls on its victims at random, like the 
fluA Pelton does not mention another reason that seems plausible to 
us: Child abuse and neglect are held in intense distaste by most Amer- 
icans, who feel great hostility toward parents who harm their children. 
People who write about malparenting do not want to encourage this 
hostility to spill over into hostility toward the poor and disadvantaged. 

Whatever the reasons, the myth of classlessness is alive and well. 1t 
is a safe bet that at the next Senate hearing on a child neglect bill, wit- 
nesses and Senators alike will agree that neglect and abuse are scattered 
throughour society, and the next feature story on child neglect you see 
on the evening news will report, as scientific fact, that child neglect is 
not a special prohlem of the poor. 


PARENTAL IQ AND PARENTING 

In all of these studies of socioeconomic Status and parenting, the obvi- 
ous but usually ignored possibility has been that the parents’ cognitive 
ability, not their Status, was an important source of the differences in 
parenting styles and also an important source of the relationship be- 
tween malparenting and childrens IQs. Indeed, even without conduct- 
ing any additional studies, some sort of role for cognitive ability must 
be presupposed. If cognitive ability is a cause of socioeconomic Status 
(yes) and if socioeconomic Status is related to parenting style (yes), then 
cognitive ability must have at least some indirect role in parenting style. 
The same causal chain applies to child maltreatment. 

Direct evidence for a link with IQ is sparse. Even the educational at- 
tainment of the abusing parents is often unreported. But a search of the 
literature through the early 1990s uncovered a number of fragments that: 
point to a potentially important role for cognitive ability, if we bear in 
mind that cognitive ability is a stronger predictor of school dropout than 
is socioeconomic Status (Chapter 6): 

• In Gil’s national study of child abuse reports, more than 65 per¬ 
cent of the mothers and 56 percent of the fathers had not com- 
pleted high school. 1 H1 

• A study of 480 infants of women registering for prenatal care at an 
urban hospital for indigent persons and their children found that 
the less educated mothers even within this disadvantaged popula- 
tion were more like ly to neglect their babiesA 

• Three studies of child maltreatment in a central Virginia city of 
80,000 people found that neglecting families had an average 
eighth-grade education, and almost three-quarters of them had 
been placed in classes for the mentally retarded during their school 
years. In contrast with the neglecting families, the abusing fami¬ 
lies tended to be literate, high school graduates, and of normal in- 
telligenceA 

• A study of fifty-eight preschool children of unspecified race in the 
Cleveland area with histories of failure to thrive found that their 
mothers' IQs average was 81A No comparison group was available 
in this study, but a mean of 81 indicates cognitive functioning at 
approximately the lOth centile. 
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• A study of twenty ahusive or neglectful mothers and ten compar- 
ison mothers from inner-city Rochester, New York, found rhat 
maltreating and nonmaltreating mothers differed signifieantly in 
their judgment about child behavior and in their prohiem-solving 
ahilities.^ 

• A clinical psychological study of ten parents who hattered their 
children severely (six of the children died) classified five as hav- 
ing a “high-grade mental defieiency” (menrally retarded), one as 
eiul 1, and another as he low average. The remaining three were das- 
sified as ahove average. ,g 

• A quantitative study of 113 two-parent families in the Nether- 
iands tound that parents with a high level of “reasoning cotnplex- 
ity” (a measure of cognitive ahility) responded to their children 
more flexihly and sensitively, while those with low levels of rea¬ 
soning complexiry were more authoritarian and rigid, indepen¬ 
dent of occupation and education. 40 

The most extensive clinical studies of neglectful mothers have heen 
conducted hy Norman Polansky, whose many years ot research hegan 
with a sample drawn from rural Appalachia, suhsequently replicared 
with an urban Philadelphia sample. He described the typical neglectful 
mother as follows: 

She is of limited intelligence (IQ below 70), has failed to achieve 
more than an eighth-grade education, and has never held . . . employ- 
ment. . . . She has at best a vague, or extremely limited, idea of what 
her children need emotionally and physically. She seldom is ahle to 
see things from the poincofview ofothers and cannot take their needs 
into consideration when responding to a conflict they experience. 41 

The specifi c IQ figure Polansky mentions corresponds to the upper edge 
of retardation, and his description of her personal ity invokes further 
links hetween neglect and intelligence. 

Another body of literature links neglectful and ahusive parents to 
Personality characteristics that have clear links to low cognitive ahih 
ity. (4 ^ J The most extensive evidence descrihes the impulsiveness, incon- 
sistency, and confusion that mark the parenting style of many ahusive 
parents. 41 Ihe ahusive parents may or may not punish their children 
more often or severely in the ordinary course of events than other pan 
enrs (studies differ on this point), 44 but the abuse characteristically 


comes unpredictably, in episodic hursts. Ahusive parents may punish a 
given behavior on one occasion, ignore ir on another, and encourage it 
on a rhird. The inconsisrency can reach mysrifying proportions; one 
study of parent-child interactions found that children in ahusing fami¬ 
lies had about the same chance of ohtaining positive reinforcement for 
aggressive behaviors as for pro-social hehaviors. 4 ^ 

The ohserved inconsistency of abusing parents was quantified in one 
ot the early and classic studies of child abuse hy Leontine Young, Wednes- 
tlay's Children. By her calculations, inconsisrency was the rule in all of 
the “severe abuse” families in her sample, in 91 percent of the “moder¬ 
ate abuse” families, 97 percent of the “severe neglect” families, and 88 
percent of the “moderate neglect” families. 40 ln one of the most exten¬ 
sive literature reviews of the behaviorul and personality dimensions of 
ahusive parents (as of 1983), the author concluded that the main prob- 
lem was not that ahusive parents were artached to punishment as such 
hut that they were simply incompetent as parents. 4 ' 

One might think that researchers seeing these malparenting patterns 
would natural ly he inspired to look at the parents 1 intelligence as a pre- 
dictor. And yet in that same literature review, examining every rigorous 
American study on the subjecr, the Word intdfigence (or any synonym 
for it) does not occur until the next-to-last page of the arricle. 1401 The 
word finally makes irs appearance as rhe literature review nears its end 
and the author tums to hi.s recommendations for future research. He 
not es that in an ongoing British prospective study of parenting, “moth¬ 
ers in their Excellent Care group, for example, were found toheof higher 
intelligence . . . than parents in their Inadequate Care group,” and then 
descrihes several ways in which the study found that maternal intellh 
gence seemed to compensate for other deprivations in the child ’s life. ! ‘ w| 
With such ohvious signals about such t ragic problems as child neglect 
and abuse, perhaps an editorial comment is appropriate: The reluciance 
of scholars and policymakers alike to look at the role of low intelligence 
in malparenting may properly be called scandalous. 

MATERNAL IQ AND THE WELL-BEING OF INFANTS 

Comhined with the literature, the NLSY lends further insight into good 
and had parenting. We begin with information on the ways in which 
women of varying cognitive ability care for their children and then turn 
to the outcomes for the children themselves. 
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Preruital Care 

In most of the ways that are easily measurable, most white women in 
the different cognitive classes behaved similariy during pregnancy. Al¬ 
most everyone got prenatal care, and similar proport ions in all cogni- 
tive classes began getting it in the early months. lf we take the NLSY 
mothers’ self-descriptions at face value, alcohol consumption during 
pregnancy was about the same across the cognitive classesA 01 The risk 
of miscarriage or a stillbirth was also spread more or less equally across 
cognitive classes. 

Smoking was the one big and medically important difference related 
to matemal intelligence: The smarter the women, the less they smoked 
while they were pregnant. Fifty-one percent of the women in the bot- 
tom two cognitive classes smoked, and 19 percent of them admitted to 
smoking more than a pack a day. In the top two cognitive classes, only 
16 percent of the white women in the NLSY smoked at all, and only 4 
percent admitted to smoking more than a pack a day. ln Class I, no one 
smoked. Smarter pregnant women smoked less even after Controlling 
for their socioeconomic backgrounds. Higher levels of education, inde¬ 
pendent of intelligence, also deterred pregnant women from smoking. 1[M| 

Low Birth Weight 

We locus here on an indicator that is known to have important impli- 
cations for the subsequent health, cognitive ability, and emotional de¬ 
velopment of the child and is also affected to some degree by how well 
women have cared for themselves during pregnancy: low birth weight A 
Low birth weight is often caused by behaviors during pregnancy, such 
as smoking, drug or alcohol abuse, or living exclusively on junk food, 
that are seldom caused by pure ignorance these days. The pregnant 
woman who never registers the simple and ubiquitous lessons about tak- 
ing care of herseif and her baby, fails to remember them, or fails to act 
on them could be willfully irresponsible or in the grip of an irresist ible 
addiction to drugs or junk food, but slow comprehension, a short time 
horizon, and difficulty in connecting cause and effect are at least as plau¬ 
sible an explanation, and all of these betoken low IQ. 

A low-birth-weight baby is defined in these analyses as an infant 
weighing less than 5.5 pounds at birth, excluding premature babies 
whose weight was appropriate for their gestational age. 15 * 1 The experi- 
ence of the NLSY mothers is shown in the table below. There does not 


Low Birth Weight Among White Babies 

Cognitive Class 

Incidence per 
1,000 Births 

I Very bright 

50 

II Bright 

16 

III Normal 

32 

iv null 

72 

V Very dull 

57 

Population average 

62 


appear to be much of a re lat ionship between intelligence and low birth 
weight; note the high rate for babies of mothers in Class I (which is dis- 
cussed in the accompanying box). But the table ohscures a strong over- 
all relationship between IQ and low birth weight that emerges in the 
regression analysis shown in the following figure. 

A white mother’s IQ has a significant role in 
determining whether her baby is underweight 
while her socioeconomic background does not 


Probability of being a low-birth-weight baby 

8 %- 

As the mother 's IQ 



Very Jow Very high 

(-2 SDs) (+2SDs) 

Note: For computing the plot, age and either SES (tor the bhick curve) or IQ (for the gray 
curve) were set at their mean values. 
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A low IQ is a major risk factor, whereas the mothers soeioeconomic 
background is irrelevant. A mother at the 2d centile of IQ had a 7 per- 
cent chance of giving birth to a low-birth-weight baby, while a mother 
at the 98th percentile had less rhan a 2 percent chance. 

Adding Poverty. Poverty is an obvious potential factor when trying to 
explain low birth weight. Overall, poor white mothers (poor in the year 
betöre birth) had 61 low-birth-weight babies per 1,000, while other 
white mothers had 36. But poverty’s independent role was small and sta- 
tistically insignificant, once the otherstandard variables vvere taken into 
account. Meanwhile, the independent role ot IQ remained as large, and 
that ot soeioeconomic Background as small, even after the effects of 
poverty were extracted. 


Can Mothers Be Too Smart for Their Own Good? 


The case of low birrh weight is the first example of others you will see in 
which the children of white women in Class 1 have anomalously had scores. 
The obvious, but perhaps too obvious, culprit is sample size. The percenr- 
age of low-birth-weight babies for Class 1 mothers, calculared using sam¬ 
ple weights, was produced hy just rwo low-birth-weight babies out of 
seventy-four births. ,vtl The sample sizes for white ("lass l mothers in rhe 
other analyses that produce anomalous results are also small, sortier imes 
under fifty and always ander one hundred, while the sample sizes for rhe 
middle cognirive classes number several hundred or sometimes thousands. 

On the other hand, perhaps the children of mothers at the very top of 
rhe cognitive distribution do in fact have different tendencies rhan rhe resr 
of rhe ränge. The possibility is sufficiently intriguing that wo report rhe 
anomalous data despite the small sample sizes, and hope that others will 
expiore where we cannot. ln the logistic regression analyses, where euch 
case is treated as an individual unit (not grouped into cognitive classes), 
rhese prohlems ot sample size do not arise. 


Adding mothers age at the time of birth . It is often thought that very 
young mothers are vulnerable to having low-birth-weight babies, no 
matter how good the prenatal care may beQ This was not true in the 
NLSY data for white women, however, where the mothers of low-birth 
weight babies and other mothers had the same mean (24-2 years). 


In sum, neither the mother’s age in the NLSY cohort, nor age at hirrh 
of the child, nor poverty Status, nor soeioeconomic background had any 
appreciable relationship to her chances of giving birth to a low-birth- 
weight baby after her cognitive ahility had been taken into account. 

Adding education. Among high school graduates (no more, no less) 
in the NLSY, a plot of the results of the Standard analysis looks visually 
identical to the one presented for the entire sample, but the sample of 
low-birth-weight babies was so small that the results do not reach Sta¬ 
tistical significance. Among the College graduates, low-birth-weight ha- 
bi es were so rare (only six out of 277 births to the white College sample) 
that a multivariate analysis produced no interpretable results. We do 
not know whether it is the education itself, or the self-selection that 
goes into having morc education, that is responsible for their low inci- 
de nee of under weight babies. 

h\jaiu M ortality 

Though we have not been able to find any studies of cognitive ahility 
and infant mortality, it is not hard to think of a rationale linking them. 
Many things can go wrong with a baby, and parents have to exercise 
both watchfulness and judgment. It takes more than love to childproof 
a house effectively; it also takes knowledge and foresight. It takes intel- 
ligenee to decide that an apparently ordinary bout of diarrhea has gone 
on long enough to make dehyclration a danger; and so on. Nor is sim¬ 
ple knowledge enough. As pediatricians can attest, it may not he enough 
to teil new parents that infants often spike a high fever, that such 
episodes do not necessarily require a trip to the hospital, but that they 
require careful attention lest such a routine fever become life threaten- 
ing. Good parental judgment remains vital. For that matter, the Prob¬ 
lem facing pediatricians dealing with children of less competent parents 
is even more Basic than getting them to apply good judgment: lt is to 
get such mothers to administer the medication that the doctor has pro- 
vided. 

This rationale is consistent with the link that has been found be- 
tween education and infant mortality. In a study of all births registered 
in California in 1978, for example, infant deaths per 1,000 to white 
women numbered 12.2 for women with less than twelve years of edu- 
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cation, 8.3 for those wirh twelve years, and 6.3 for women with rhirteen 
or more years ofeducation, and the role of education remained signifi- 
cant after Controlling for birth order, age of the mother, and marital 
Status. 1561 

We have been unable to identify any study that uses tested IQ as an 
explanatory factor, and, with such a rare event as infant mortahty, even 
the NLSY cannot answer our questions satisfactorily. The rcsults cer- 
tainly suggest that the questions are worth taking seriously. As ot the 
1990 survey, the NLSY recorded forty-two deaths among children born 
to white women with known IQ. Some of these deaths were presumably 
caused by severe medical problems at birth and occurred in a hospital 
where the mother’s behavior was irrelevant. 1 ^ 71 For infants who died be- 
tween the second and twelfth month (the closest we can come to defiiv 
ing “after the baby had left the hospital”), the mothers of the surviving 
children tested six points higher in IQ than the mothers of the deceased 
babies. (The difference for mothers of children who died in the first 
month was not quite three points and for the mothers of children who 
were older than 1 year old when they died, virtually zero.) The samples 
here are too small to analyze in conjunction with socioeconomic Status 
and other variables. 

POVERTY THROUGHOUT EARLY CHILDHOOD 

ln Chapter 5, we described how the high-visibility policy issue of chil¬ 
dren in poverty can he better understood when the mothers IQ is 
brought into the picture. Here, we focus more specifically on the poverty 
in the early years of a child’s life, when it appears to be an especially inv 
portant facror (independent of other variables) in affecting the child’s 
development.^ The variable is much more stringent than simply expe- 
riencing poverty at some point in childhood. Rather, we ask ahout the 
mothers of children who lived under the poverty line throughour their 
first three years of life, comparing them with mothers who were not in 
poverty at any time during the child’s first three years. The Standard 
analysis is shown in the figure below. There are few other analyses in 
Part II that show such a steep effect for hoth intelligente and SES. If 
the mother hasevenan average intelligente and average socioeconomic 
background, the odds of a white child’s living in poverty for his or her 
first three years were under 5 percent. If either of those conditions feil 
below average, the odds increased steeply. 


A white mother’s IQ and socioeconomic background each has 
a large independent effect on her child’s chances of spending 
the first three years of life in poverty 


Probability that a child will live in poverty 
throughout the first three years of life 
40% - 



Note; For computing the plot, and eit her SES (lor the black curve) or 1 Q (tor rhe ^ r ray 
curve) were ser ar rheir mean values, 


The Role of Preexisting Poverty 

When we ask whether the mother was in poverty in the year prior to 
birth, it turns out that a substantial amount of the effect we attrihute to 
socioeconomic background in the figure really reflects whether the 
mother was already in poverty when the child was born. If you want to 
know whether a child will spend his first three years in poverty, the sin¬ 
gle most useful piece of Information is whether the mother was already 
living under rhe poverty line when he was born. Nevertheless, adding 
poverty to the equation does not diminish a large independent role for 
cognitive ability. A child born to a white mother who was living under 
the poverty line hut was of average intelligence had almost a 49 percent 
chance of living his first three years in poverty. This is an extraordinarily 
high chance of living in poverty for American whites as a whole. But if 
the same wo man were at the 2d centile of intelligence, the odds rose to 
89 percent; if she were at the 98th centile, they dropped to 10 percent. 1 ’ 91 
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The changes in the odds were proportionately large for women who were 
not living in poverty when the child was born. 

The Role oj Education 

For children of women with a high school diploma (no more, no less), 
the relationships of IQ and socioeconomie background to the odds that 
a child would live in poverty are the same as shown in the figure above— 
almost equally important, with socioeconomie background fracrionally 
more so—except that the odds are a little lower than for the whole sam¬ 
ple (the highest percentages, for mothers two Standard deviarions he- 
low the mean, are in the high 20s, instead of the mid-30s). As this 
implies, the highest incidence of childhood poverty occurs among 
women who dropped out of school. Among the white College sample (a 
hachelor’s degree, no more and no less), there was nothing to analyze; 
only one child of such mothers had lived his first three years in poverty. 

IQ AND THE HOME ENVIRONMENT FOR CHILD 
DEVELOPMENT 

In 1986, 1988, and 1990, the NLSY conducted special supplementary 
SLirveys of the children and mothers in the sample. The children were 
given tests of mental, emotional, and physical development, to which 
we slvall turn present ly. The mothers were questioned about their chil- 
dren’s development and their rearing practices. The home Situation was 
directly observed. The survey instruments were based on the so-ealled 
HOME (Home Observation tor Measurement of the Environment) 
index. 1601 

Denzens of questions and observations go into creating the summary 
measures, many of them interesting in themselves. Children of the 
brightest mothers ( who also tend to be the best educated and the most 
affluenr) have a big advantage in many ways, especially on such he- 
haviors as reading to the child. On other indicators that are less criti- 
cal in themselves, but indirectly suggest how the child is being raised, 
children with smarter mothers also do better. For example, mothers in 
rhe top cognitive classes use physical punishmenr less offen (though 
they agree in principle that physical punishment can be an appropriate 
response), and the television set is off more of the time in the homes of 
the top cognitive classes. 

Treating the HOME index as a continuous scale running from “very 


had" to “very good” home environments, the advantages of white chil- 
dren with smarter mothers were clear. The average child ol a Class V 
woman lived in a home at the 32d percentile of home environment, 
while the home of the average child of a Class 1 woman was at the 76th 
percentile. The gradations for the three intervening classes were regu¬ 
lär as well. 1011 Overall, the correlation of the HOME index with IQ 
for white mothers was +.24, statistically significant but hardly over- 
powering. 

In trying to identify children at risk, this way of looking at the rela- 
tionship is not necessarily the most revealing. We are willing to assume 
that a child growing up in a home at the 90th centile on the HOME in¬ 
dex has a “better” environment than one growing up at the 50th. Per- 
liaps the difterenee berween a terrific home environment and a merely 
average one helps produce children who are at the high end on various 
Personality and achievement measures. Rut it does not necessarily fol- 
low that the child in the home at the 50th centile is that much more at 
risk for the worst outcomes of malparenting than the child at rhe 90th 
centile. Both common sense and much of the scholarly work on child 
development suggest that children are resilient in the face of moderate 
disadvantages and obstacles and, on the other hand, that parents are 
frustrat ingly unable to fine-tune good resulrs for their children. 

Rut resilience has its limits. Children coming from the least nurtur- 
ing, most punishing environments are indeed at risk. We will therefore 
focus throughout this section on children who are in the bottom 10 per- 
cent on various measures of rheir homes. i(vl 


Which White Children Grow 

Up in the Worst Homes? 

Cognitive Class of 

Percentage of 

Their Children Growing 

Up in Homes in the Bottom 

the Mother 

Decile of the HOME Index 

I Very bright 

0 

11 Bright 

2 

III Normal 

6 

IV Oull 

11 

V Very Juli 

24 

All whires 

6 
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In rhe case of the HOME index, the percentages of white children of 
mothers in the different cognitive classes who are growing up in homes 
that scored at the hottom are displayed in the fahle. It was extremely 
rare for chiidren of women in the top cognitive classes to grow up in 
these “worst homes” and quite uncommon for children of women 
throughout the top three-fourths of the lQdistribution. Only in the bot- 
tom cognitive Hasses did the proportion of such children grow, and then 
the proportions rose rapidly. Nearly one out of four of the children of 
the dullest mothers was growing up in a home that also ranked in the 
hottom decile on the HOME index. 1631 

The Rate of Socioeconomic Background 

The usual assumption about maternal behavior is that a woman’s so- 
cioeconomic Status is crucial—that she passes on ro her children the 
henefits or disadvantages of her own family Background. The figure he' 
low summarizes the Standard analysi comparing SES and IQ. 

A white mother’s IQ is more important than her socioeconomic 
background in predicting the worst home environments 

Probability of being in the bottom decile of the HOME index 

20 %- 



Note: For compuring the ploc, age and cither SES (tor the black curve) or IQ (tor rhe ^ray 
curve) wert ser ar their mean valucs. Addirional independenr variables were used to conrrol 
tor the tesr year and the age of the children. 


Both factors play a significanr role, but once again it is worse (at least 
for the white NLSY population) to have a mother with a low IQ than 
one from a low socioeconomic background. Grven just an average IQ 
for the mother, even a mother at the 2d centile t)n socioeconomic back- 
ground had less than a 10 percenr chance of providing one of the “worst 
homes” for her children. Bur even with average socioeconomic back- 
ground, a mother at the 2d centile of Intelligence had almosr a 17 per- 
cent chance of providing one of rhesc “worst homes.” 

The Role of Piwerty and Welfare 

Many of the prohlems experienced by poor children are usually attrib- 
uted in both public dialogue and academic writings to poverty itself. 64 
The reasons for this widely assumed link between poverty and deveb 
opmental probiems are harder to spell out than you might rhink. To re' 
peat a point that must always he kept in mind when thinking about 
poverty: Most of the world’s children throughout history have grown up 
poor, with “poverty” meaning material deprivation far more severe than 
rhe meaning of “helow the poverty line” in today’s America. Many of 
the disadvantages today’s children experience are not the poverty itself 
but the Contemporary correlates of poverty: being without a father, for 
example, or living in high'Crime neighborhoods. Today, high propor¬ 
tions of poor children experience these correlates; fifty years ago, com- 
paratively few poor children did. 

But the re are reasons ro rhink that the HOME index might be influ- 
enced by poverty. Reading to children is a good thing to do, for exam¬ 
ple, and raises the HOME score, but children’s books are expensive. It: 
is easier to have books in the house if you can afford to buy them than 
if you have to trek to the library—perhaps quite far from home—to get 
them. Similar comments apply to many of the indicators on the HOME 
index that do not require wealth but could be affected by very low in- 
come. We therefore explored how the HOME index was related to the 
mothers poverty or welfare recipiency in the ealendar year before the 
HOME score was ohtained. 163 * 

Poverty proved to be important, with “being Ln a state of poverty” 
raising the odds of being in the worst decile of the HOME index from 
4 percent to 11 percent, given a mother of average IQ and socioeco¬ 
nomic Status. 1061 But adding poverty to the equation did not diminish 
the independent role of cognitive ability. For example, if the mother 
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had very low IQ (the 2d centile) and was in poverty, the odds of being 
in the worst decile on the HOME index jumped from 11 percent to 26 
percent. Generally, adding poverty to the analysis replaced the impact 
of the mothers socioeconomic background, not of her intelligence. 

Then we turn to welfare. The hypothesis is that going on welfare sig- 
nifies personality characteristics other than IQ that are likely to make 
the home environment deficient—irresponsibility, immaturity, or lack 
of initiative, for example. Therefore, the worst homes on the HOME 
index will also tend to be welfare homes. This hypothesis too is bome 
out by the data: Welfare recipiency was a slighrly more powerful 
predictor of being in a “worse home” than poverty—but it had as little 
effect on the independent role of IQ. 

In trying to decide among competing explanations, the simplest thmg 
to do is to enter both poverty and welfare in the analysis and see which 
wins out. We summarize the outcome by first considering a child whose 
mother is of average intelligence and socioeconomic background. lf his 
mother is either poor or on welfare (but not both), the odds of havmg 
a tertible home environment (hottom decile on the HOME index) are 
8 or 9 percent. If the mother has an IQ of 70, the odds shoot up to 18 
to 21 percent. If the mother has very low intelligence, is poor, and is 
also on welfare, the odds rise further, to 34 percent. A table with some 
of the basic permutations is given in the note. 16 ' 1 

Still, many of the causal issues remain unresolved. The task for schob 
ars is to specify what it is about poverty that leads to the outcomes as- 
sociated with ir. With the data at hand, we cannot go much further in 
distinguishing herween the effects of lack of money and the effects ot 
other things that “being in poverty” signifies. In particular, the way that 
poverty and welfare internet in producing a poor home environment 
provides many hints that need to be followed up. 

What can be said unequivocally is that low income as such does not 
prevent children from being raised in a stimulating, nurturing environ¬ 
ment. Such is the story of the regression coefficients, and a conclusion 
that accords with child rearing throughout history. By the same token, 
it does not take a genius to provide a child with a stimulating, nurtur¬ 
ing environment. The average differences in environment across the 
cognitive classes are large and in many ways troubling, hur, in percent- 
age terms, they explain little of the variance. Abundant examples of ex- 
cellent parents may be found through all but the very lowest ränge of 
cognitive ability. 


The Role of Educadon 

We conclude, as usual, by considering the role of education through the 
high school graduate and College graduate subsamples. Holding mater- 
nal age and the mothers socioeconomic background constant ar their 
means, College graduates tend to do well, no matter what their cogni¬ 
tive ability (within their restricted ränge), even though cognitive abil- 
ity retains a Statistical ly significant re lat ionship. Within the high school 
sample, the effects of cognitive ability are plain; the odds of being in the 
bottom decile on the HOME index for the child of a mother of average 
socioeconomic hackground drop from 15 percent for a high school grad¬ 
uate ar rhe 2d IQ centile to 5 percent for a comparahle person at the 
98th IQ centile. As in the earlier analyses, the mosr important impact 
of cognitive ability within the high school graduates seems to be at the 
low end. Socioeconomic hackground also continues ro play an impor¬ 
tant independent role, but Iess than IQ. 

DEVELOPMENTAL OUTCOMES 

The NLSY also administered batteries of rests regarding rhe develop- 
mental outcomes for the children of NLSY mothers. We review several 
indicators hriefly, then present a summary index showing rhe interrela- 
tionships the mother’s cognitive ability, socioeconomic background, 
poverty, and welfare. 


Some More Complications 


The HOME invenrory has rwo components—a Cognitive Stimulation 
score and an Emotional Support score—both adapted to rhree separate age 
grnups (under 3, 3 ro 5, and over 5 years of age). We conducted a variety 
of analyses to explore the subtests’ roles for different age groups. Briefly, 
the mother s IQ had the dominant role in determining the Emotional Sup- 
port score for children through the age of 5, whereas its role in determin¬ 
ing Cognitive Stimularion was roughly coequal with education and 
socioeconomic hackground—the opposite of what one mighr predict. Ma¬ 
temal IQ was especially important for Emotional Support to the 3- ro 5- 
year-old group. It would be worthwhile for investigators to explore with 
other data the NLSY’s indications that parental IQ is especially important 
for the home environment from ages 3 to 3, and the peculiar finding that 
parental IQ is more important for Emotional Support than for Cognitive 
Stimulation. 
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Temperament in Very Young Children 

The first of the measures applies to very young children (12 to 23 
months), and consists of indexes of “difficulty” and “friendliness.” Once 
again we focus on children who exhibit the most conspicuous signs of 
having problems—those in the bottom decile—as shown in the tob 
lowing table. 168 * Generally, babies were more “difficult”—more irritable, 
more fearful, and less sociable—for mothers with lower cognirive abib 
ity, and they were also less friendly, as measured by this index. 


Which White Toddlers Have the Worst Temperaments? 

Percentage of 


Percentage of 

Children in the Most 


Children in the Least 

Difficult Decile on 


Friendly Decile on 

the Difficulty 

Cognitive Class 

the Friendliness 

Index 

of the Mother 

Index 

— 

1 Very bright 

— 

4 

11 Bright 

3 

8 

III Normal 

5 

14 

IV Dull 

11 

— 

V Very dull 

12 

8 

All whites 

6 


Motor and Social Development in Infancs and Toddlers 

Motor and social development is, in effect., a set ot measures designed 
to capture whether the child is progressing in the ways described as non 
mal in the baby manuals by Spock, Brazelton, et al. The table below 
shows the results for children through the age of 3. The results look like 
a U-shaped curve, with a big jump in Class V. Since sample sizes in both 
Class I and Class V are under 100 (75 and 81, respectively), this Infor¬ 
mation falls in the category of interesting but uncertain. 

Behavioral Problems in Older Children 

For older children, the NLSY employed an instrument that measured 
behavioral problems, with subscales on antisocial behavior, depression, 


Which White Children Are Behind in 
Motor and Social Development? 

Cognitive Class 

Percentage of Children 
in the Bottom Decile 
of the Motor & Social 

of the Mother 

Development Index 

I Very bright 

10 

II Bright 

5 

III Normal 

6 

IV Dull 

10 

V Very dull 

32 

All whites 

7 


headstrongness, hyperactivity, immature dependency, and peer con- 
flict/social withdrawab The table below shows the results for those who 
had the most severe problems—those in the worst 10 percent on these 
measures. 


Which White Children Have the 

Worst Behavioral Problems? 

Cognitive Class 

Percentage of Children 
in the Worst Decile of the 

ofthe Mother 

Behavioral Problems Index 

1 Very bright 

ii 

11 Bright 

6 

III Normal 

10 

IV Dull 

12 

V Very dull 

21 

All whites 

10 


Once again, there is the curious case of the elevated percentage for 
children of mothers in Class I. The most prudent assumption is that it 
is an artifact of small sample sizes, but the possibility remains that some- 
thing eise is going on worth investigating in greater detail, with larger 
samples. 

An Index of Developmental Problems 

Fach of the developmental indexes we have reviewed is based on a nunv 
her of individual items, which in tum lend themselves to a wide variety 
of analyses that would take us far beyond the scope of this discussion. 
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We conducted many analyses for che separate Indexes, hut che overall 
patlerns were similar. For our purposes in conveying to you the general 
pattem of res ul cs, it is sufficient Co summarize che results for a hmad 
question: What independent role, if any, does the morher’s IQ have on 
the probabilicy thac her child experienc.es a suhstantial developmental 
prohlem? We created a simple “developmental problem index,” in 
which the child scores Yes if he or she were in che bottom decile ot any 
ot the tour indieators in a given test year, and No it not. The results ave 
shown in the next figure. 


Both a white mother’s IQ and socioeconomic 
background have moderate relationships with 
the developmental problems in the child 

Probability ofhaving a child in the bottom decile 

on one of the developmental indieators 

20 # - 

As the mother's IQ 



m |-,- 1 - 1 -r 

Very low Very high 

(-2 SDs) (+2 SDs ) 


The pattem shown in the figure generally applies to the tour devel- 
opinent indieators separately: IQ has a somewhar [arger independent eh 
fect th;tn socioeconomic background, hut of modest size and marginal 
Statistical significance. 


The Role oj Poverty , Welf are, and Illegitirnacy 

We repeated the analyses adding a poverty variable (Was the mother 
living in poverty in the year the developmental measures were taken?)* 
a welfare variable (Was the mother on AFDC in the year the de- 
velopmental measures were taken?), and legitimacy variable (Was 
the child born outside marriage?) When entered separately or in 
combina on, each had a statistically significant independent role. 1691 
Consider the stark contraut between a child horn ro an unmarried 
mother, on welfare and in poverty, and a child born to a married 
mother, not on welfare and ahove the poverty line. Given a mother 
with average IQ and socioeconomic background, the chances that the 
first child had a suhstantial developmental problem were almost twice 
as high as those tacing the second child—IS percent compared to 8 
percent. Rut taking rhese factors into accounr did not wipe out the in¬ 
dependent rede of either IQ or the mother s socioeconomic back- 
ground; in fact, the independent etieers oi IQ md socioeconomic 
background alter extraeting the independent role of poverty, illegiti- 
macy, and welfare, is visually almost indistinguishable from the one 
shown ahove. 

The Role of Education 

Analyses of the College graduate sample were provocative hut no more 
than that, hecause only 29 out of 470 chiLclren of white College gradu- 
ates vvho were tested (6 percent) showed up with a suhstantial devel- 
opmenral problem. The provocative finding was that among those 29, 
5 were ehildren of women in dass 1(10 percent of the 50 such children 
tested). Thus in the College sample, the Statistical resuh of Holding so- 
cioeconomic background constanr was that higher IQ was associated 
with a Suhstantial ly higher probability of having developmental Prob¬ 
lems. Five our of SO is ot course not enough to make much of these num- 
bers, hut we commend the finding to our eolleagues vvho specialize in 
child development. 

Within the high schon! sample, the independent roles ot IQ and 
socioeconomic background were almosr identieal, and ot the same 
Order of magnitude indicated in rhe figure for the enrire whire 
sample. 
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THE COGNITIVE OUTCOME 

We finally come to the intelligente of the children of white NLSY 
women. The measure of intelligente we shall he using is the Peahody 
Picture Vocabulary Test (PPVT), a widely used measure of cognitive 
ability in children that has the advantage of not requiring that the child 
he able to read. The scores for the NLSY children are expressed in terms 
of the national norms for the PPVT, whieh use a mean of 100 and a Stan¬ 
dard deviation of 15. Because IQ scores tend to be volatile for children 
under the age of 6, we limit the sample to children who wert at least 6 
when they took the test. 

The unsuiprising news in the next table is that the children tend to 
resemble their mothers in IQ.'° But by continuing to use the “worst 


IQ in the Mother and the Child for Whites in the NLSY 


Cognitive Class 

Mean IQ of 

Percentage of Their 
Children in the 

of the Mothers 

Their Children 

Bottom Decile of IQ 

1 Very bright 

— 

-- 

11 Bright 

107 

7 

III Normal 

100 

6 

iv null 

95 

17 

V Very du» 

81 

59 

All whites 

99 

10 


decile” as a way of zeroing in on the children most at risk, the table 
makes another point: White parents throughout the top three-quarters 
of the IQ distribution have few children who fall into the bottom decile 
of IQ.*' 11 For mothers in the bottom quarter of the distribution, however, 
the proportion of low IQ children rises precipitously. We return to this 
issue in Chapter 15. 

The Role of Socioeconomic Background 

Consistent with the conclusions drawn in a large technical literature, 
the IQ of the NLSY mothers was much more important than their so¬ 
cioeconomic background in determining their children s IQ. 1 '“ 1 A white 
child’s IQ in the NLSY sample went up hy 6.3 IQ points for each in- 
crease of one Standard deviation in the mothers IQ, compared to 1.7 
points for each increase of one Standard deviation in the mother’s so- 


cioeconomic background (in an analysis that also extracted the effects 
of the mothers age, the tesr year, and the age of the child when t.ested). 
When we examine the probability that the child will fall in the bottom 
decile of IQ, we arrived at the results shown in the next figure. 

A white mother’s IQ dominates the importance of socioeconomic 
background in determining the child’s IQ 


Probability of having a child in the bottom decile of IQ 
30 % - 



i-2SDs) (+2 SOs) 

Note: For compurmg rhv plot, age and eirher SES (for the Mack curve) nr IQ (for che jjroy 
eurve) wert 1 set ar their mean values. Additional mdepeMulenr variables were used toconrroi 
tor rhe tost year and the age ot rhe children when rhey rook the rest. 

A mother ar rhe 2d IQ centile but of average socioeconomic back¬ 
ground had a $0 percent chance that her child would be in the bottom 
decile of IQ, compared to only a 10 percent chance facing the woman 
from an equivalently terrible socioeconomic background (2d centile on 
the SES index) but with an average IQ. 

The Role of Poverty and the Home Environment 

In discussions of IQ among disadvantaged groups, ir seems plausible that 
factors such as poverty and the aspeers of the home environment would 
have an effect on the child’s IQ. Suppose, for example, we were to ig- 
nore rhe mother’s IQ, and look only at her socioeconomic background, 
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her poverty Status in the year before her child was tested, and her HOME 
index score. In that case, we could document the conventional wisdom; 
both socioeconomic background and the home environment have large 
effects on whether a child scores in the bottom IQ decile. Poverty has 
a smaller and statistically marginal effect. But when we add the mother’s 
IQ, all o i those other effects become both small in magnitude and sta¬ 
tistically insignificant. After taking socioeconomic background, the 
HOME index, and pretest poverty into account, the independent eff 
fect of IQ remains virtually identical to the one shown on the preced- 
ing figure. 

The Role of Education 

None of the children in the bottom decile of IQ had a mother wirb a 
bachelor s degree. In the high school graduate sample, the independent 
role of the mother’s IQ remains large and the independent role of so- 
cioeconomic background remains small. But in the process of exploring 
this issue, we came upon an effect of education that is worth exploring: 
Women who did not complete high school were at much higher risk of 
producing children in the bottom decile of IQ than women in the high 
school sample (meaning a high school diploma and exactly 1 2 years of 
education), even after Controlling for mother’s IQ and socioeconomic 
background. Additional analyses did not clarity what this finding inight 
mean; we commend it to our colleagues for a fulffscale analysis. 


haviors, from prenatal negligence that leads to low hirth weight, to post¬ 
natal treatment of the child associated with neglect and abuse, to de- 
velopmental outcomes, to cognitive outcomes. 

On the other hand, these data provide little or no evidence that the 
smartest women make the best mothers. Children can flourish in a wide 
variety of environments that are merely okay. But some environments 
are so bad that no one can seriously dispute that they are bad, and even 
the most resilientchildren have diffieulty overcoming them. These truly 
disadvantaged homes are disproportionately associated with women at 
the low end of the intelligence distrihution, even after other contribut- 
ing faetors such as poverty and socioeconomic Status are taken into ac¬ 
count. 


THE ASYMMETRY OF GOOD AND BAD PARENTS 

Granting the many exceptions at the individual level, the relationship 
of cognitive ability to parenting is unmistakable. Some oi these analy¬ 
ses have involved measures that are arguable. Can we real ly he sure that 
the indicators of what constitutes a stimulating and nurturing environ¬ 
ment are not just reflectionsof the preferences of the upper middle dass? 
We hope our readers do not take this easy way out. If the indicators that 
were used in the studies we have reported are indeed ones that you find 
valid in your own beliefs about what children need, then the conclu- 
sion follows: Over the long run and in the broad perspective, based on 
your best understanding of the realities of child rearing, smart parents 
tend to be better parents. People with low cognitive ability tend to be 
worse parents. This conclusion holds for a wide ränge of parenting be- 




Chapter 11 

Crime 


Among che most firmly estahlished facts about crimnuil offenden is that their 
distnbutüm of IQ scores differs from that of the Population at large. Taking 
the scientific literature as a whole, criminal äffenden have average JQs of 
about 92, eight points below the mean . More serious or chronic offenders gen- 
erally have lower starres than more casual offenders. The rclationship of IQ 
to criminality is especially pronounced in the small fraction of the population, 
primarily young men, who constitute the chronic crimmals that account for a 
dislrroportionate amount of crime. Offenders who have been caught da not 
score mach lower, if at all, than those who are getting away with their crimes . 
Hokling socioeconnmic Status constant does little to explain away the rela¬ 
tionship between crime and cognitive ability. 

High intelligente also provides some protection against lapsing into crimi¬ 
nality for people who otherwise are at risk. Those who have grotim up in tur¬ 
bulent homes, have parents who were themselves criminal, or who have 
exhibited the childhood traits that presage crime are less likely to become crim- 
inals as culults ij they have high IQ. 

These findings from an extensive research literature are supported by the 
evidence from white males in the NLSY. Low IQ was a risk factor for crimi¬ 
nal behavior , whether criminality was measured by incarceratiun or by seif 
acknowledged crimes. The socioeconnmic background of the NLSTs tehtte 
males was a negligible risk factor once their cognitive ability was taken into 
account. 


C rime can cear a free society apart, because free societies depend so 
crucially on faith that the other person will behave decently. As 
crime grows, society must Substitute coercion tor cooperation. The first 
casualty is not just freedom but the bonds that make community Ute at- 
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tractive. Yes, it is always possible to buy better locks, stay oft tfte streets 
after dark, regard every stranger suspiciously, post security guards every- 
where, but these are poor subsritutes for living in a peaceful and safe 
neighborhood. 

Most Americans think that crime has gotten far too high. But in 
the ruminations about how the nation has reached this state and what 
mighr be done, too little attention has been given to one ot the best' 
documented relationships in the study of crime: As a group, crinnnals 
are below average in intelligence. 

As with so many of the other problems discussed in the previous six 
chapters, things were not always so bad. Good crime statistics do not go 
back very far in the United States, but we do not need statistics to re- 
mind Americans alive in the 1990s of t.imes when they feit secure walk- 
ing late at night, alone, even in poor neighhorhoods and even in the 
nation’s largest cities. In the mid- 1960s, crime took a conspicuous turn 
for the worse. The overall picture using the official statistics is shovvn in 
the figure below, expressed as multiples of the violent crime rare in 1950. 

The figure shows the kind of crime that worries most people most vis- 
cerally: violent crime, which consists of robbery, murder, aggravated as- 
sault, and rape. From 1950 through 1963, the rate for violent crime was 

The boom in violent crime after the 1950s 


Proportional change in number of violent 
crimes reported to the police (1950=1) 



1950 1960 1970 1980 1990 

Source; Uniform Crime Kefxnts, annual, Federal Bureau ot Investigation. 


almost flat, followed by an extremely rapid rise from 1964 to 1971, fob 
lowed by continued increases until the 1980s. The early 1980s saw an 
interlude in which violent crime decreased noriceably. Bur the trend- 
line for 1985-1992 is even steeper than the one for 1963-1980, mak- 
ing ir look as if the lull was just that—a brief respite from an increase in 
violent crime that is now thirty years old. |1J 

There is still some argument among the experts about whether the 
numhers in the graph, drawn from the FBl’s Uniform Crime Reports, 
mean what they seem to mean. But the disagreement has limits. Draw- 
ing on sophisticated analyses of these numbers, the consensus con- 
clusions are that victimization studies, hased on Interviews of crime 
victims and therefore including crimes not reported to the police, 
indicare that the increase in the total ränge of crimes since 1973 
bas not been as great as the official statistics suggest, but that the 
increase reflected in the official statistics is also real, capturing changes 
in crimes that people consider serious enough to warrant reporting to 
the police. 121 

DEPRAVED OR DEPRIVED? 

The juvenile delinquents in Leonard Bernsteins West Side Story teil Of- 
licer Krupke that they are “depraved on account of we’re deprived,” 
showing an astute grasp of the poles in criminologieal theory: the psy- 
chological and the sociological.^ Are criminals psychologically dis- 
tinct ? Or are they ordinary people responding to social and economic 
circumstances? 

Theories of criminal behavior were mostly near the sociological pole 
from the 1950s through the 1970s. Its leading scholars saw criminals as 
much like the rest of us, except that soeiety eannarks thern for a life of 
criminality. Some of rhese scholars went further, seeing criminals as free 
of personal blame, evening up the score with a soeiety that has victim- 
ized them. The most radical theorists from the sociological pole argued 
ihat the definition of crime was in irselt ideological, creating “criminals” 
of people who were doing nothing more than behaving in ways that the 
power structure chose to define as deviant. In their more moderate 
forms, sociological explanarions continue to dominate public discourse. 
Many people take it for granted, for example, that poverty and unenv 
ployment cause crime—classic sociological arguments that are distin- 
guished more by their popularify than by evrdence. 4 
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Theories nearer ehe psychological pole were more common earlier 
in the history of criminology and have lately regained acceptance 
among experts. Here, the emphasis shifts to the characteristics of rhe 
offender rather than to his circumstances. The idea is that criminals 
are distinctive in psychological (perhaps even hiological) ways. They 
are deficient, depending on the particular theory, in conscience or in 
self-restraint. They lack normal attachment to the mores of their cub 
ture, or they are peculiarly indifferent to the feelings or the good opin- 
ion of ofhers. They are overendowed with restiess energy or with a 
hunger for adventure or danger. ln a term that was in common use 
throughout the nineteenth and early twentieth centuries, chronic ob 
fenders may he suffering from “moral insanity.” 1 In other okbfashioned 
vocahularies, they have heen called inhumane, atavistic, demented, 
monstrous, or hestial—all words that depict certain individuals as 
something less than human. In their most extreme form, psychologi¬ 
cal theories say that some people are born criminal, destined by their 
hiological makeup to offend. 

We are at neither of these theoretical poles. Like almost all other 
students of crime, we expect to find explanations from both sociology 
and psychology. The reason for calling attention to the contrast he- 
tween the theories is that public discussion has lagged; it remains more 
nearly stuck at the sociological pole in public discourse than it is among 
experts. In this chapter, we are interested in the role that cognirive 
ahility plays in creating criminal offenders. This by no means requires 
us to deny that sociology, economics, and public policy might play an 
important part in shaping crime rares. On rhe contrary, we assume that 
changes in those domains are likely to interact with personal charac¬ 
teristics. 

Among the arguments often made against the claim that criminals 
are psychologically distinctive, two are arguments in principle rather 
than in fact. We will comment on these two first, because they do not 
require any extensive review of the factual evidence. 

Argument 1: Crime rates have changed in recent times more than 
people’s cognitive ahility or personalities could have. We must there- 
fore find the reason for the rising crime rates in people’s changing eir- 
cumstances. 


When crime is changing quickly, it seems hard to blame changing 
personal characteristics rather than changing social conditions. But 
hear in mind that personal characteristics need not change everywhere 
in society for crime’s aggregate level in society to change. Consicler age, 
for example, since crime is mainly the business of young people hetween 
15 and 24. h When the age distrihution of the population shifts toward 
more people in their peak years for crime, the average level of crime 
may he expected to rise. Or crime may rise disproportionately if a large 
hulge in the youthful sector of the population fosters a youth culture 
that relishes unconventionality over traditional adult values. The ex- 
ploding crime rate of the 1960s is, for example, partly explained hy the 
baby boomers’ reaching adolescenced Or suppose that a style of child 
rearing sweeps the country, and it turns out that this style of child rear- 
ing leads to less control over the behavior of rehellious adolescenrs. The 
change in style of child rearing may predicrably he followed, fifreen or 
so years later, hy a change in crime rates. If, in short, circumstances tip 
toward crime, the change will show up most among those with the 
srrongest rendencies to break laws (or the weakest rendencies to obey 
thein). 1 * 1 Understanding those tendencies is the business of theories ar 
the psychological pole. 

Argument 2: Behavior is criminal only because society says so. 

There cannot he psychological tendencies to engage in behavior de- 

fined so arbitrarily. 

This argument, made frequently during the 1960s and 1970s and al- 
ways more populär among intellectuals than with the general public, is 
heard most often opposing any Suggestion that criminal behavior has 
hiological roots. How can something so arbitrary, say, as not paying one’s 
taxes or driving above a 55 mph speed limit he inherited? the critics ask. 
Behavior regarding taxes and speed limits certainly cannot he coded in 
our DNA; perhaps even more elemental behaviors such as robbery and 
murder cannot either. 

Our counterargument goes like this: Instead of crime, consider be- 
havior that is less controversial and even more arbitrary, like playing the 
violin. A violin is a cultural artifact, no less arbitrary than any other 
man-made object, and so is the musical scale. Yet few people would ar- 
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gue that the firsr violinists in the nations great orchestras are a random 
sample of the population. The interests, talents, self-discipline, and ded- 
ication that it takes to reach their [evel of accomplishment have roots 
in individual psychology—quite possibly even in biology. The Variation 
across people in any behavior, however arbitrary, will have such roots. 
To that we may add that the core crimes represented in the violent crime 
and property crime indexes—murder, robbery, and assault—are really 
not so arbitrary, unless the moral codes ot human cultures throughout 
the worid may bc said to be consistently arbitrary in pretty much the 
same way throughout recorded human history. 

But even if crime is admitted to be a psychological phenomenon, why 
should intelligence be important? What is the logic that might lead us 
to expect low intelligence to be more frequently linked with criminal 
tendencies than high intelligence is? 

One chain of reasoning Starts from the observation that low intelli- 
ge nee often translates into failure and frustration in school and in the 
job market, lf, for example, people of low intelligence have a hard time 
finding a job, rhey might have more reason to commit crimes as a way 
of making a living. lf people of low intelligence have a hard time ae- 
quiring Status through the ordinary ways, crime might seem like a good 
alternative route. At the least, their failures in school and ar work may 
fester resentment toward society and its laws. 

Perhaps the link between crime and low IQ is even more direct. A 
lack of foresight, which is often associated with low IQ, raises the at- 
tractions of the immediate gains from crime and lowers the strength of 
the deterrents, which come later (if they come at all). To a person of 
low intelligence, the threats of apprehension and prison may lade to 
meaninglessness. They are too abstract, too far in the luture, too uiv 
certain. 

Low IQ may be part of a broader complex of factors. An appetite fcor 
danger, a stronger-than-average hunger for the things that you can get 
only by srealing if you cannot buy them, an antipathy toward conven- 
tionality, an insensitivity to pain or to social ostracism, and a host of de- 
rangements of various sorts, combined with low IQ, may set the stage 
for a criminal career. 

Finally, there are moral considerations. Perhaps the ethical princi- 
ples for not eommitting crimes are less accessible (or less persuasive) to 
people of low intelligence. They find it harder to understand why roh- 
hing someone is wrong, find it harder to appreciate the values of civil 


and cooperative social life, and are accordingly less inhibited from act- 
ing in ways that are hurtful to other people and to the community at 
large. 

With these preliminaries in mind, let us explore the thesis that, what- 
ever the underlying reasons might be, the people who lapse into crimi¬ 
nal behavior are distinguishable from the population at large in their 
distribution of intelligence. 

THE LINK BETWEEN COGNITIVE ABILITY AND CRIMINAL 
BEHAVIOR: AN OVERVIEW 

The Statistical association between crime and cognitive ability has been 
known since intelligence testing began in eamest. The British physi- 
cian Charles üoring mentioned a lack of intelligence as one of the dis- 
tinguishing traits of the prison population that he described in a 
landmark contribution to modern criminology early in the Century. 10 In 
1914, H. H. C3oddard, an early leader in both modern criminology and 
the use of intelligence tests, concluded that a large fraction of convicts 
were intellectually subnormal. 11 

The subsequent history of the study of the link between IQ and crime 
replays the larger story of intelligence testing, with the main difference 
being that the attack on the IQ/crime Unk began earlier than the 
broader attempt to discredit IQ tests. Even in the 1920s, the link was 
called into question, for example, by psychologist Carl Murchison, who 
produced data showing that the prisoners of Leavenworth had a higher 
mean IQ than that of enlisted men in World War I. [U| Then in 1931, 
Edwin Sutherland, Americas most prominent criminologist, wrote 
“Mental Deficiency and Crime,” an article that effectively put an end 
to the study of IQ and crime for half a Century. M Observing (accurately) 
that the ostensible IQ differences between criminals and the general 
population were diminishing as testing procedures improved, Suther¬ 
land leaped to the conclusion that the remaining differences would dis- 
appear altogether as the state of the art improved. 

The difference, in fact, did not disappear, but that did not stop crim¬ 
inology from denying the importance of IQ as a predictor of criminal 
behavior. For decades, criminologists who followed Sutherland argued 
that the IQ numbers said nothing about a real difference in intelli¬ 
gence between offenders and nonoffenders. They were skeptical about 
whether the convicts in prisons were truly representative of offenders 
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in general, and they disparaged the tests’ validity. Weren’t tests just mea- 
suring socioeconomic Status by other means, and weren’t they biased 
against the people from the lower socioeconomic classes or rhe minor- 
ity groups who were most likely to break the law for other reasons? they 
asked. By the 1960s, the association between intelligence and crime was 
altogether dismissed in criminology textbooks, and so it remained ui> 
til recently. By the end of the 1970s, students taking introductory 
courses in criminology could read in one widely used textbook that the 
belief in a correlation between low intelligence and crime “has almost 
disappeared in recent years as a consequence of more cogent research 
findings,” 14 or learn from another Standard textbook of “the practical 
abandonment of feeblemindedness as a cause of crime. 

It took two of the leading criminologists of another generat ion, Travis 
Hirschi and Michael Hindelang, to resurrect the study of IQ and crim- 
inality that Sutherland had buried. In their 1977 article, “Intelligence 
and Delinquency: A Revisionist View,” they reviewed many studies that 
included IQ measures, took into account the potential artifacts, and 
concluded that juvenile delinquents were in fact characterized by sub- 
stantially belovv-average levels of tested intelligence. 16 Hirschi and Hin¬ 
delangs werk took a while to percolate through the academy (the aurhor 
of the 1982 edition of one of the textbooks quoted above continued to 
make no mention whatever of IQ), 1 ' but by the end of the 1980s, most 
criminologists accepted not just that an IQ gap separates offenders and 
nonoffenders, but that the gap is genuinely a difference in average in- 
tellectual level or, as it is sometimes euphemistically called, “academic 
competence.” Criminology textbooks now routinely report the correla¬ 
tion between crime and intelligence, and although some questions of 
interpretation are still open, they are narrower than they used to be be- 
cause the correlation itself is no longer in dispute. 1161 

The Size of the IQ Gap 

How big is the difference between criminals and the rest of us? Taking 
the literature as a whole, incarcerated offenders average an IQ of about 
92, 8 points below the mean. The population of nonoffenders averages 
more than 100 points; an informed guess puts the gap between of¬ 
fenders and nonoffenders at about 10 points. 1191 More serious or more 
chronic offenders generally have lower scores than more casual of¬ 
fenders. 1 ^ 1 The eventual relationship between IQ and repeat offend¬ 


ing is already presaged in IQ scores taken when the children are 4 
years old." 1 

Not only is there a gap in IQ between offenders and nonoffenders, 
but a disproportionately large fraction of all crime is committed by peo¬ 
ple toward the low end of the scale of intelligence. For example, in a 
twenty-year longitudinal study of over 500 hundred boys in an uniden- 
tified Swedish Community, 30 percent of all arrests of the men by the 
age of 50 were of the 6 percent with IQs helow 77 (at the age of 10) and 
80 percent were of those with IQs below 100. 22 However, it Stands to 
reason (and is supported by the data) that the population of offenders 
is short of very low-scoring persons—people whose scores are so low that 
they have trouble mustering the competence ro commit most crimes. 24 
A sufficiently low IQ is, in addition, usually enough to exempt a person 
from criminal prosecution. 24 

Do the Unintelligent Ones Commit More Crimes —or Just Get Caught 
More Ojten! 

Some critics continue to argue that offenders whose IQs we know 
are unrepresentative of the true criminal population; the smart ones 
presumably slipped through the net. Surely this is correct to some 
degree. If intelligence has anything to do with a persons general 
competence, then it is not implausible that smart criminals get arrested 
less offen because they pick safer crimes or because they execute their 
crimes more skillfully. [2S1 But how much of a bias does this introduce in¬ 
to rhe data? Is there a population of uncaught offenders with high 
IQs committing large numhers of crimes? The answer seems to be 
no. The crimes we can trace to the millions of offenders who do pass 
through the criminal justice System and whose IQs are known account 
for much of the crime around us, particularly the serious crime. There 
is no evidence for any other large population of offenders, and barely 
enough crime left unaccounted for to permit such a population’s exis- 
tence. 

In the small amount of data available, the IQs of uncaught offenders 
are not measurably different from the ones who get caught/ 6 Among 
those who have criminal records, there is still a significam negative cor¬ 
relation between IQ and frequency of offending. 27 Both of these kinds 
of evidence imply that differential arrests of people with varying IQs, 
assuming they exist, are a minor factor in the aggregate data. 
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Intelligence as a Preventative 

Looking at the upposite side of the picture, those who do nor commit 
crimes, it appears that high cognitivc ability protects a person from he- 
coming a criminal even if the other precursors are present. One study 
followed a sample of almost 1,500 boys born in Copenhagen, Denmark, 
between 1936 and 1938.'’* Sons whose fathers had a prison record were 
almost six times as likely to have a prison record themselves (by the age 
of 34— 36) as the sons ot men who had no police record of any sort. 
Among these high-risk sons, the ones who had no police record ar all 
had IQ scores one Standard deviation higher than the sons who had a 
police record.*"^ 

The protective power of elevated intelligence also shows up in a New 
Zealand study. Boys and girls were divided on the hasis of their behav- 
ior by the age of 5 into high and low risk for delinquency. High-risk chib 
dren were more than twice as likely to hec.ome delinquent by their 
mid-teens as low-ri.sk children. The high-risk boys or girls who did not 
become delinquent were the ones vvith the higher IQs. This was also 
true for the low-risk boys and girls: The nondelinquents had higher IQs 
than the delinquents/ 0 

Children growing up in troubled circumstances on Kauai in the 
Hawaiian chain confirm the pattem. Several hundred children were fol¬ 
lowed in a longitudinal study for several decadesT Some of the children 
were identif ied hy their second birthday as being statisrically “vulnera¬ 
ble” to behavioral disorders or delinquency. These were children suHer¬ 
ing from two or more of the following circumstances: they were being 
raised in troubled or impoverished families; had alcoholic, psychologi- 
cally disturbed, or unschooled (eighi years or less of schooling) parents; 
or had experienced prenatal or pennatal physiological stress. Two-thirds 
of these children succumbed to delinquency or other psychological dis- 
turbances. But how about the other third, the ones who grew up with- 
out hecoming delinquents or disturbed psychologically ! Prominent 
among the protective factors were higher inrellectual ability scores than 
the average for the vulnerable groupQ 

THE LINK BETWEEN COGN1TIVE ABILITY AND CRIMINAL 
BEHAVIOR: WHITE MEN IN THE NLSY 

In the United States, where crime and race have become so intertwined 
in the public mind, it is especially insrructive to focus on just whites. To 


The Rest of the Story 

The sratistically disringuishable personal characteristics of criminals go far 
beyond IQ. There is, for example, the enormous differente between the 
levels of male and temale criminality, which cannot be explained by in- 
tellecrual differences between the sexes. Accounts of the rapidly expand- 
ing literature on the psychological and hiological correlares of criminality, 
which has become highly informative about everything from genes to early 
childhood precursors, may be tracked in numerous scientific journals and 
books. Probahly as much could be learned about individual differences 
beyond intelligence that characterize the chronically unemployed, un- 
married mothers, neglectful parents, and others who have been the sub- 
jects of the other chaprers in Part II. But that is just surmise at this point. 
The necessary research has either not been done at all or has been done in 
only the sketchiest way. IMl 


siinplify matters, wc also limir the NLSY sample to males. Crime is still 
overwhelmingly a man’s vice. Among whites in the sample, 83 percent 
of all persons who admitted to a criminal conviction were male. 

butT/wring Self-Report Data 

In the 1980 interview wave, the members of the NLSY sample were 
asked detailed questions about their criminal activity and their in- 
volvement with the criminal justice system. These data are known as 
seif-report data, meaning that we have to go on what the respondent says. 
One ohvious advantage of self-reports is that they presumably include 
information about the crimes of offenders whether or not they have been 
caught. Another is that they circumvent any hiases in the criminal jus¬ 
tice system, which, some people argue, contaminate official criminal 
statistics. But can self-report data be trusted? Criminologists have ex- 
plored this question for many years, and the answer is yes, but only if 
the data are treated gingerly. Different racial groups have different re¬ 
sponse patterns, and these are compounded by differences between the 
genders. 1 ^ 51 Other issues are discussed in the note. ,Ul * 

Our use of the NLSY self-report data sidesteps some of the problems 
by limiting the analysis to one ethnic group and one gender: white 
males. Given the remaining problems with self-report data, we will con- 
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centrate in this analysis on events that are on the public record (and 
the respondent knows are on the public record): heing stopped by the 
police, formal charges, and convictions. In doing so, we are following a 
broad finding in crime research that official contacts with the law eiv 
forcement and criminal justice System are usefully accurate reflections 
of the underlying level of criminal activity. |37) At the end of the discus- 
sion, we show briefly that using self-report data on undetected crimes 
reinforces the conclusions drawn from the data on detected crimes. 

JQ and Types of Criminal Involvement 

The typical finding has been that between a third and a half of all ju¬ 
veniles are stopped by police at some time or another (a proportion that 
has grown over the last few decades) but that 5 to 7 percent of the pop- 
ulation account for about half the total number of arrestsG In the case 
of white males in the NLSY, 34 percent admitted having been stopped 
at some time by the police (for anything other than a minor traffic Vi¬ 
olation), but only 3 percent of all white males accounted for half of the 
self-reported “srops.” 

Something similar applies as we move up the ladderof criminal sever- 
ity. Only 18 percent of white males had ever formally been charged with 
an offense, and a litrle less than 3 percent of them accounted for half 
the charges. Only 13 percent of white males had ever been convicted 
of anything, and 2 percent accounted for half of the convictions. Based 
on these self-reports, a very small minority of white males had serious 
criminal records while they were in this 15 to 23 age ränge. 

Like studies using all races, the NLSY results for white males show a 
regulär relationship between IQ and criminality. The table below pro- 
senrs the average IQs of white males who had penetrated to varying lev- 
eis of the criminal justice System as of the 1980 interviewt Those who 


Criminality and IQ Among White Males 

Deepest Level of Contact with the 


Criminal Justice System 

Mean IQ 

None 

106 

Stopped hy the police hut not booked 

103 

Booked but not convicted 

101 

Convicted but not incarcerated 

100 

Senrenced to a correcrional facility 

93 


reported they had never even been stopped by the police (for anything 
other than a minor traffic violation) were ahove average in intelligence, 
with a mean IQ of 106, and things went downhill from there. Gose to 
a Standard deviation separated those who had never been stopped by 
the police from those who went to prison. 

A similar pattern emerges when the criminal involvements are sorted 
by cognitive dass, as shown in the next table. Involvement with the 
criminal justice System riscs as IQ falls from Classes I through IV. Then 


The Odds of Getting Involved with the Police 
and Courts for Young White Males 
Percentage Who in 1980 Reported Ever Having Been: 


Cognitive 

Stopped by 

Booked for 

Convicted of 

Sentenced to 

Class 

the Police 

an Offense 

an Offense 

lncarceration 

1 Very bright 

18 

5 

5 

0 

11 Bright 

27 

12 

7 

1 

III Normal 

37 

20 

15 

3 

IV Hüll 

46 

27 

21 

7 

V Very du 11 

33 

17 

14 

7 

Overall 

34 

18 

9 

3 


we reach Class V, with IQs under 73. If we take the responses at face 
value, the Class Vs are stopped, charged, and convicted at lower rates 
than the Class IVs but are senrenced to correcrional facilities at rates 
almost exactly the same rate. We noted earlier that people at the low- 
est. levels of intelligence are likely to he underrepresented in criminal 
statistics, and so it is in the NLSY. lt may be that the offenses of the 
Class Vs are less frequent hut more serious than those of the Class IVs 
or that they are less competent in getting favorahle treatment from rhe 
criminal jusrice sysrem. The data give us no way to teil. 

ln addition to selhreporrs, the NLSY provides data on criminal he~ 
havior by noting where the person was interviewed. ln all the interviews 
from 1979 to 1990, was the young man ever interviewed in a correc- 
tional facility ? The odds shown in the table below (computed from the 
unrounded results) that a white male had ever been interviewed in jail 
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The Odds of Doing Time for Young White Males 

Cognitive Class 

Percentage Ever 
Interviewed in a 
Correctional Facility 

I Very bright 

1 

11 Bright 

1 

111 Normal 

3 

IV Du 11 

7 

V Very dull 

12 

Overall 

3 


were fourteen times greater for Class V than for white rnales anywhere 
in the top quarrile of IQ. 

Being incarcerated at the time of the interview signifies not just 
breaking the law and serving time but also something ahout the dura- 
tion of the sentence, which may explain the large increase at the bot- 
rom of the ability distribution. The NLSY sample of white males echoes 
the scientific lirerature in general in showing a sizable IQ gap between 
offenJers and nonoffenders at eaeh level of involvement with the crinv 
inal justice System. 

The Role of Socioeconomic Background 

We will use both self-reports and whether the interviewee was incar¬ 
cerated at the time of the interview as measures of criminal behavior. 
The self-reports are from the NLSY men in 1980, when they were still 
in their teens or just out of them. It combines reports of misdemeanors, 
dmg offenses, property offenses, and violent offenses. Our definition of 
criminality here is that the inan’s description of his own behavior put 
him in the top decile of frequency of self-reported criminal activity. lv * 
The orher measure is whether the man was ever interviewed while be¬ 
ing confined in a correctional facility between 1979 and 1990. When 
we run our Standard analysis for these two different measures, we get the 
results in the next tigure. 

Both measures of criminality have weaknesses but different weak- 
nesses. One relies on self-reports but has the virtue of including uiv 
caught criminality; the other relies on the workings of the criminal 
justice system but has the virtue of identifying people who almost cer- 
tainly have commirted serious offenses. For both measures, after con- 


On two diverse measures of crime, the importance of IQ 
dominates socioeconomic background for white men 

The probability of meeting either of two criteria of criminality 
200 - 

Black lines: As IQ goes from low to high 

Gray lines: As parental SES gocs front low to high 



Mott 1 : i'or computin^ the plot, age anJ either ShS (for tfie hlack curve.s) or IQ (foi the t^ray 
eurves) were set at their mean values. 

troliing for IQ, the men’s socioeconomic background had little or noth¬ 
ing to do with crime. In the case of the self-report data, higher socioe- 
conomic Status was associated with higher reporred crime after 
Controlling for IQ. ln the case of inoarceration, the role ot socioeco- 
nomic background was dose to nil after conrrolling for IQ, and statis- 
tically insignificant. By either measure ot crime, a low IQ was a 
significant risk factor. 

The Role uf a Broken Home 

When people think about the causes of crime, they usually rhink not 
only of the role of juvenile delinquenr’s age and socioeconomic hack' 
ground but also of what used to be called “broken homes.’ 1t is now an 
inadequate phrase, because many families do not even begin with a mar- 
ried husband and wife, and many broken homes are reconstituted (in 
some sense) through remarriage. But wharever the specific way in which 
a home is not intact, the children of such families are usually more likely 
to get in trouble with the law than children from intact families. 41 This 
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was rrue for che NLSY white males. An intact family consisting of che 
hiological mother and tather was associated with hetter outeomes for 
their children than any of the other family arrangements. Was the young 
man ever stopped by the police? ThirtV'two percenr of white males from 
intact families compared to 46 percent of all others. Booked for an ob 
fense? Fifteen percent compared to 29 percent. Convicted of an offen so? 
Eleven percent compared to 21 percent. Senrenced to a oorrectional hv 
cility? Two percent compared to 7 percent. 

Although family setting had an impact on crime, it did not explain 
away the predictive power of IQ. For example, a young man from a bro- 
ken family and an average IQ and socioeconomic hackground had a 4 
percent chance of having been interviewed in jail. Switch his IQ to the 
2d centile, and the odds rise to 22 percent. (Switch his socioeconomic 
hackground to the 2d centile instead, and the odds rise on ly from 4 to 
5 percent .) The same conclusions apply to the measure of self-reported 
crime. 

The Role of Education 

Scholars have been arguing about the relationship of education to crime 
and delincjuency for many years without settling the issue. The case of 
the NLSY white males is a classic example. Of those who were ever uv 
terviewed in jail, 74 percent had not gotten a high school diploma. None 
had a College degree. Clearly somethingabout gettingseriously involved 
in crime competes with staying in school. Low IQ is part of that “stmnv 
thing” in many cases, but the relationship is so strong that other factors 
are prohahly involved—for example, the same youngster who is willing 
to burglarize a house prohahly is not the most obedient of pupils; the 
youngster who commits assaults on the Street prohahly gets in fights on 
the school grounds; the youngster who is undeterred by the prospect of 
jail time prohahly is not much motivatecl hy the prospect of getting a 
high school degree; and so forth. 

Does high school dropout actually cause the suhsequent crime ? Many 
people assumed so until Delbert Elliott and Harwin Voss published a 
study in 1974 that concluded the opposire: Crime diminished alter 
school dropout. 4 “ Since then, everyone has agreed that eventual 
dropouts tend to have high levels of criminal activity while they are in 
school, but disputes remain about whether the rates fall or rise alter the 
dropout occurs.^ 1 


For our purposes, it makes little sense to examine the continuing role 
of IQ in our usual educational samples when the action is so conspiciv 
ously concentrated among those who fall neither in the high school nor 
the College graduate samples. Running our Standard analysis on white 
males who did not get a high school diploma did not shed much more 
light on the matter. 144 ’ Given the restriction of ränge in the sample (the 
mean IQ of the white male dropout sample was 91, with a Standard de' 
viation of only 12.5), not much can he concluded from the fact that the 
ones at the very bottom of the cognitive abiliry disrribution were less 
likely to report high levels of criminal activity. For these school 
dropouts, the likelihood of having been interviewed in jail rose as IQ 
feil, but the relationship was weaker than for the unrestricted sample of 
white males. 

CRIME, COGNITIVE ABILITY, AND CONSC1ENCE 

By now, you will already be anticipating the usual caution: Despite the 
relationship of low IQ to criminality, the great majority of people with 
low cognitive ability are law ahiding. We will also take this opportunity 
to reiterate that the increase in crime over the last thirry years (like the 
increases in illegitimacy and weif are) cannot he attributed to changes 
in intelligence but rather must be blamed cm cstirer factors, which may 
have put people of low cognitive ability at greater risk than before. 

The caveats should not obscure the importance of the relationship 
of cognitive ability to crime, however. Many people tend to think of 
criminals as coming from the wrong side of the tracks. They are correcr, 
insofar as that is where people of low cognitive ability disproportiorv 
ately live. They are also correct insofar as people who live on the right 
side of the tracks—whether they are rieh or just steadily employed work- 
ing'dass people—seldom show up in the nation s prisons. But the as- 
sumption that too glibly follows from these ohservations is that the 
economic and social disadvantage is in itself rhe cause of criminal he' 
havior. That is not what the data say, however. ln trying to understand 
how to deal with the crime problem, much of the attention now given 
to prohlems of poverty and unemployment should be shifted to another 
question altogether: coping with cognitive disadvantage. We will return 
to this question in the final chapter, when we consider policy changes 
that might make it easier for everyone to live within the law. 



Chapter 12 

Civility and Citizenship 


A free society demands a ckizenry that willingly participates in the civic en¬ 
ter pnse , in matters as grarul as national elections and as commonplace as 
tieighborliness. Lacking this quality — civility, in its core meaning—a society 
must replace freedom unth coercion if it is to mamtain order. This chapter ex- 
amines the contribution of cogniüve ability tu the capacity for civility and cit¬ 
izenship. 

Most manifestations of civility are too fleeting to be measured and studied. 
One realm of activity that does leave measurable traces is political involve- 
ment, which mcludes both partkipatum in political activities and some knowl- 
edge and sophistication about them. 

For assessing any relationship between political irwolvement and IQ, the 
best data, surprisingly, are frorn Studies of children, and the results are am- 
sistent: Brighter children of all socioeconomic classes , including the poorest, 
leam more rapidly about politics and how govemment works , and are rnore 
likely than duller children to read about , cü’scuss, and participate in poiiricai 
activities. The gap between brighter and duller children in political develop¬ 
ment U’idem unth age, unhke the static gap across socioeconomic classes. 

For adults, the Standard theury of political inwk’emem for many years Juls 
assumed that socioeconomic Status is the vital link. People at highere tatus lev- 
els vote rnore, and they know and care more about political matters than du 
people at (ou>er [evels of Status. Bat the auailable research offers ampfc evi' 
dence that the key elementfor predicting political involvement is educatioruil 
level. The people who vote least and who care the leerst about political issues 
are not so much the poor as the uneducated, whatever their income or occu- 
pation. Why does educatüm matter so muchl The fragmentary studies avail- 
able indicate that education predicts political involvement in America because 
it is primanly a proxy for cognitive ability. 

The NLSY does not have the data for pursuing this manifestation of civil¬ 
ity, hut it permits us to explore another aspect of it: To what extern is high in- 
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te%ence a^sociared with the behaviors associated wich “middle-class values”? 
The ansiver is that the brighter young people of the NLSY are also che ones 
whose lives most resemble a .semetimes disdained stereotype: They stick with 
school, are pluggingaway in the workforce, and are loyal to their sporne, ln - 
sofar as Intelligence helps lead people to hehave in these ivays , it is also a force 
for maintaining a civil society. 

A mericas political System relies on the civility of its citizens—“ci¬ 
vility” not in the Contemporary sense of mere politeness hut ac- 
cording to an older meaning which a dictionary close at hand defines 
as “deference or allegiance to the social order befitting a Citizen.” 111 The 
wording of the definition is particularly apt in the American case. Ci¬ 
vility is not obedience hut racher “allegiance” and “deference”—vvords 
with old and honorahie meanings that are now largely lost. The ohject 
of these Sentiments is not the government hut a social order. And these 
things are required not of a suhject hut of a Citizen. Taken together, the 
elemenrs of civility imply hehavior that is hoth considered and consid- 
erate—-precisely the kind of hehavior that the Founders relied upon to 
sustain their crcation, though they would have heen more likely to use 
the word virtue rhan ck'iiiry. 121 

The point Ls that, given such civility, a free society as envisioned hy 
the Founders is possible. “Civihized” people do not need to he tightly 
constrained by laws or closely monitored hy the organs of state. Lack- 
ing such civility, they do, and society must over time hecome much less 
free. That is why civility was relevant to the Founders’ vision of a free 
society and also why it remains relevant today. In Part IV, we consider 
further the link hetween intelligence and the polity. At this point, we 
ask what the differenc.es are hetween people that explain whether they 
are civil. Specifically, what is the role of intelligence? 

Much of what could go under the heading of civility is not readily 
quanrified. Mowing the lawn in the summer or keeping the sidevvalks 
shoveled in the winter, maintaining a tolerable level of personal hygiene 
and grooming, returning a lost wallet, or visiting a sick friend are not 
entirely dictated hy fear oflawsuits orof retaliation from outraged neigh- 
bors. They likely have an element of social engagement, of caring about 
ones neighhors and community, of what we are calling civility. Most 
such everyday acts of civility are too fleeting to he caught in the net of 
ohservation that social science requires. 


Forrunately, the behaviors that go into civility tend to he of a piece, 
and some acts leave clear traces that can he aggregated and studied. In 
the precedirig chaprer, we examined one set of such behaviors, crime. 
Crime is important in irself, of course, hut it also captures the negative 
pole of disassoeiation from society at large and the community in par- 
ticular. Everything we know about the lives of most criminals suggests 
that in their off-dury hours they are not comtnonly shoveling the side- 
walk, visiting sick friends, or returning lost wallets—or doing the myr- 
iad other things that signify good neighhors and good citizens. In that 
light, the chapter on crime may he seen as a discussion of a growing in- 
civility in American life and the contrihution that low cognitive abil- 
ity makes to it. 

POLITICAL PARTICIPATION AS AN OUTCROPPING OF 
CIVILITY 

Political participation is not the thing-iivitself of civility. Most of us can 
recall acquaintances who show up reliahly at town council meetings and 
are hectoring, opinionated, and generally destructive of community life. 
But, as always, we are talking about Statistical tendencies, and for that 
purpose political participation is not a bad indirect measure. 

Consider the act of voting. We have friends, conscientious in many 
ways, who do not vote and who even look at us, registering and voting, 
offen at some inconvenience, with hemused superiority. They point out 
wich indisputahle accuracy that our hallots account for less than a mil- 
Uonth of the overall outcome of most statewide elections, not to men- 
tion national ones, and that nt) major political contest in United States 
history has ever heen decided by a single vote. 111 Are we hehaving irra- 
tionally hy voting? 14 ' 

Not if we value civility. ln thinking about what it means to vote, a 
passage in Aristotle’s Polilics comes to mind. “Man is by nature a polit¬ 
ical animal,” Aristotle wrote, “and he who hy nature and not by mere 
accident is without a state, is either a bad man or above humanity; he 
is like the ‘tribeless, lawless, hearthless one,’ whom Homer denounces.’” 
The polling place is a sort of civic hearth. ln the aggregate (though not 
always in every instance) those who do not vote, or who vote less con- 
sistently, are weaker in this manifestation of civility than those who do 
vote consistently. Think inwardly ahout why you try to keep up with is- 
sues that affect your neighborhood or at least try to do some cramming 
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as an elecrion approaches, and why you usually manage to get to the 
polling place when the election arrives (or feel guilty when you do not). 
Are vve wrong to assume that the reasons have something to do with a 
consciousness of thc duties ot being a Citizen and good neighbor? 
Therein lies the modest claim we make here. There is norhing particu- 
larly virtuous or civil about being a political activist, hut the simpler 
ways in which we carry on the hasic political business of a democracy 
hetoken the larger artitudes that make up civiliry. 

DEVELOPING CIVILITY IN CHILDREN 

The connection between intelligence and political involvement has 
been more thoroughly studied tor children than for adults. ln part, this 
is he cause until recently schon Ls routinely gave IQ tests to children. 
With the children’s intelligence tesr scores as a baseline, social seien- 
tists could then study whatever variables they were interested in, such 
as political awareness or mterest. Besides being relatively easy to do, 
studies ot childhood political development circ.umvented some of the 
questions that arise with adults; children, for example, have no vested 
political or economic interests (beyond the approval of parents or otlv 
ers) to complicate the analysis of their responses. 

One major study assembled a sample of 12,000 children in grades 2 
through 8, from schools in middle- or working-class neighhorhoods in 
both large and small ciries in various regions ot the country in the early 
1960s N The child ren provided intormation about their fathers’ occu- 
pations and interest in politics. School records included IQ scores for 
ahout 85 percent of the children. The heart of the study was a serics of 
quesrions about the childrens level and ränge of political develop¬ 
ment. 1 '* They were, tor example, asked whether they knew which brauch 
of govemment enacted laws, whether they understood the duties of thc 
President and the courts, whether they ever read ahout politics in the 
newspapers or talked ahout it to their parents or friends, whether they 
feit that they were protected by the govemment or whether individu- 
als could exert any political influence on their own, whether they had 
ever wom campaign buttons or handed out leaflets for a candidate. 
Their attitudes about voting, about the duties of a Citizen, about polit¬ 
ical change, about legal punishmenr, among other things, were probed. 

The results were predictahle in many ways. Younger children tended 


to see the go vorn ment in terms of individuals (government - the cur¬ 
rent President) and as a fixed and absolute entity; older children were 
hetter informed, were more likely to think in terms ot institutions in- 
stead of individuals, and had a clearer sense ot the duties ot citizenship. 
The higher a child’s socioeconomic background, the more rapidly his 
political socialization proceeded. Among the dimensions most affected 
by socioeconomic Status—again, no surprise—was a child’s sense of po¬ 
litical efficacy.* 8 * 

The big surprise in the study was the impact of IQ, which was larger 
than that of socioeconomic Status. Brighter children from even the 
poorest households and with uneducated parents learned rapidly ahout 
politics, about how the government works, and about the possihilities 
for change. They were more likely to discuss, read ahout, and partici- 
pate in political activities than intelleetually slower children were. Not 
only was the gap in political development across cognitive classes larger 
than the gap across socioeconomic classes, ir tended to widen with age, 
while the gap due to socioeconomic dass did not—an important dis- 
tinction in trying to unclerstand the compararive roles ot intelligence 
and socioeconomic Status. IQ differences tend to he dynamic; socio¬ 
economic ditterences, static. The more important distinction from our 
perspective, however, is that cognitive ahility had more impact, and so- 
cioeconomic Status virtually none, on a child’s perception of the duties 
of citizenship. lf this he civility, then it is most purely a result of intel¬ 
ligence, ar least among the variables exainined. 

A study of older children—approximately 400 high school students— 
set out todetermine the importance of intelligence, contrasted with so¬ 
cioeconomic status, as a factor in political development. 9 The survey 
questions tapped a wide ränge of political behaviors and attitudes. From 
the responses, scales were constructed tor fourteen political dimensions. 
The youngsters were characterized by an Overall measure of socioeco- 
noinic background, plus separate measures of parental education, family 
wealth, media exposure, and a measure of verbal inteil igence made avail - 
able from school records. To a remarkable degree and with only a tew ex- 
ceptions, euch ot the political dimensions was most strongly correlatcd 
with intelligence.* 10 ’ This was true of scales that measured political 
knowledge, as would be expected.’ 111 But the hright youngsters were also 
much more aware of the potentialities of govemment and the duties of 
citizenship—civility again. A multivariate analysis of the results indi- 
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cated that intelligence per se, rather than socioeconomic Status, was 
driving rhe reiationships, and that when socioeconomic Status was 
significantly correlated with a dimension of political invoivement, it 
was via its effects on intelligence. It is possible that the importanee of 
intelligence was somewhat inflated in this study hecause the youngsters 
were disproportionately from working-dass Backgrounds, he nee under- 
estimating the impact of socioeconomic Status in more representative 
samples. However, the qualitative outcome leaves no doubt that intelli- 
gence, apart from the usual socioeconomic variables, has a potent effect 
on political behavior for teenagers, as well as for preteens. |u! 

VOTING BEHAVIOR AMONG ADULTS 

Social scientists do not find it easy to dragoon large samples oi adult 
Amerieans and make them sit still for the kinds of assessments of polir- 
ical invoivement that can he conducted with children. But they try 
nonetheless, and they have had some success, mostly centering on vot- 
ing. 

Depending on the election and the historical period, the turnout in 
elections for federal ofticeholders ranges from about 25 to 70 percent, 
with the recenr level in presidential elections in the 45 to 60 percent 
ränge. It may or may not he a pity that so many of our fdiow citizens 
fall to vote, but it is a boon to social scientists. With the deep split be- 
tween voters and nonvoters, voting has been an invaluahle resouree for 
gaining a glimpse into the nature of this manifesration of civility. ,n) 

Voting and Socioexonomic. dass 

The literature on voting repeats the familiär story: Most of the analysis 
has focused on socioeconomic dass, not cognitive ability. The Standard 
model of political participation, including voting, is that it is highly de- 
pendent on socioeconomic Status. 14 “College graduates vote more than 
high school graduates; white-coüar workers vote more than blue-collar 
workers; and the rieh vote more than the poor,” as Wolfinger summa- 
nzed it V The connection between political participation and social Sta¬ 
tus is so strong that almost any measure of it, no matter how casual, wäll 
pick up some part of the relationship. The impression we all have that 
elections are settled mostly by the votes of the middle and upper elasses 
hroadly construed is confirmed by careful scrutiny, if socioeconomic 
Status is the only measure taken of potential voters. 


When we are able to lock behind the isolated vote to broader kinds 
of political behavior, the same relationship prevails. The landmark study 
on this topic was conducted hy Sidney Verba and Norman Nie, who 
polled several thousand people representing the national population in 
1967 not only about their voting hut also about other political activi- 
ties—campaigning, demonstrating, contacting officials, and so on. 16 
Verba and Nie identified six categories of political activity, from “to- 
tally inactive” at one end to the “totally active” at the other, with four 
gradations in between. Almost without exception, however political 
participation was def'ined, socioeconomic Status was not only a signifi- 
cant predictor in a Statistical sense, but the differences across elasses 
were large. 1171 Among the total ly inactive (the lowest category), people 
were almost six times as likely to be from the bottom third in socio¬ 
economic Status as from the top third; among the rotally active (the 
highest category), more than four times as many were from the top third 
as from the bottom third. ln between the extremes of political partici¬ 
pation, rhe trends were unbroken and smooth: The higher the level of 
participation, the more likely the person was from a high-status Back¬ 
ground; the lower the level of participation, the more likely rhe person 
was from a low-status Background. 

Voting and Education 

What is it about socioeconomic Status that leads people to behave so 
differently? Verba and Nie did not present the breakdowns that permit 
an answer to that question. 1181 For that, we tum to another study, hy 
Raymond Wolfinger and Steven Rosenstone, that used the Current 
Population Surveys (CPS), conducted hy the Census Bureau, to an¬ 
swer questions about voting. 1191 The authors asked which of the three 
components of socioeconomic Status—education, income, and occupa- 
tional Status—-primarily influences voting. The clear answer was edu- 
cation. A College education raised a person s probahility of voting almost 
40 percentage points over what it would be if the person had less than 
five years of education, independent of income or occupational Status; 
postgraduate education raised it even more. Even for people in the top 
income category (more than $75,000 per year in 1990 dollars) a College 
education added 34 percentage points to a person s probahility of vot¬ 
ing. Occupational Status per se had an even smaller overall effect than 
income, and it was ambiguous toboot. For example, with education hold 
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constant, sales and clerical workers voted ar slightly higher rares than 
Professionals or managers. 

Educational attainment eorrelates not just with voting itself hut with 
political knowledge, interest, and attitudes—in short, with polirical so¬ 
phistication/ 0 Political sophistication, in turn, eorrelates with voting. 1211 
Educated people read more ahout political issues, and they keep their 
television sets and radios tuned to the news and public issues programs 
more than do people with less education. They think ahout polirical is¬ 
sues at more abstract levels than do less educated people, and less in 
terms of concrete, personal benefit. They are more likely to di sagree 
with Statements like, “So many people vote in the national election that 
it doesn’t matter much to me whether I vote or not.” Or, “It isn’t so im¬ 
portant to vote when you know your party doesn’t have a chance to 
win.”“ By disagreeing, educated people seem to be saying that they par- 
ticipate in an election even when the only payoff is a sense of having 
done rhe right thing, which we see as a mark of civility. 

Orher scholars who have examined rhis issue have come to the same 
conclusion that Wolfinger and Rosenstone demonsrratej most deci- 
sively: it is predominantly education, rather than incorne or occupa- 
tional Status, that links voting and socioeconomic Status/ 1 Soine 
scholars go so far as to conclude that, aside from the major effect of ed¬ 
ucation, voting and socioeconomic Status have litile to do with each 
other. IMI This turns the Standard theory on its head: Rather than ex- 
plaining the correlation hetween education and voting as an effect of 
socioeconomic Status, the evidence says that the correlation hetween 
socioeconomic Status and voting would more properly be attributed to 
education. 

Turning the explanation on its head may solve a puzzle that Verba 
and Nie noted/" Having shown that political leaders respond to pres¬ 
sure from their constituendes, they wondered why the upper socioeco¬ 
nomic classes participated more in political matters, when those ar the 
bottom were more dependenr on the governinent to solve their Prob¬ 
lems. If the people who have the most to gain or lose participated rhe 
most, then the lower classes would vote more than the middle or upper. 
Why don’t they? The answer is that participation is less a matter of di¬ 
rect benefit than of civility in the sense we are using the word here, and 
civility is higher among more educated people than among less educated 
ones. |Jft| 

Some of the more cynical dismissals of American political life are 
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similarly answered. Poor and humble workers, it is sometimes argued, 
are Jisenfranchised whether they vote or not, hecause rhe government 
Joes rhe bidding of the rieh and well placed. Ir is small wonder, then, 
that they do not vote, rhis argument continues. Bur the evidence shows 
ir is not so much the poor and humble who fail to vote; it is the uned- 
ucated. Ir may be easy to believe that the poor are disenfranchised, hut 
it is less obvious why it should be the uneducated (poor or not). What 
is the cynic to make of the facr that an underpaid hut well-educated 
shop c.lerk is more likely to vote than a less educated, rieh businessman? 

Vorm» and Cognitivc Ability 

he link hetween education and voting is clear. Doos it real ly signify a 
link hetween cognirive ability and voting? There is an indirect argu- 
ment that says yes, descrihed in the notes,^' 1 hut we have been able to 
find only rwo studies that tackle the question directly. 

I he first did not have an actual measure of IQ, only ratings of in- 
telligence by interviewers, based on their impressions after some train- 
ing. Phis is a legitimate procedure—rated intelligence is known to 
correlate w r ith tesred intelligence—hur the results must be treated as 
approximate. With that in mind, a multivariate analysis of a national 
sample in rhe American Narional Election study in 1976 showed that, 
of all the variables, by far the most significant in determining a person’s 
political sophistication were rated intelligence and ex presse d int er- 
est. Interest, however, was itself most tellingly affected by intelli¬ 
gente. 1 ^ 1 The more familiär independent variables—education, in- 
come, occuparional Status, exposure to the media, parental interest 
in politics—had small or no effects, after rated intelligence was taken 
into account. 

The one study of political involvement that included a test of intel¬ 
ligence was conducted in the San Francisco area in the 1970s. The 
intelligence test was a truncated one, based on a dozen vocahulary 
itemsQ About 150 people were interviewed in depth and assessed on 
political sophistication, which is known to correlate with political par¬ 
ticipation. 10 The usual background variables—incorne and education, 
ior example—were also obtained. Educational attainment was, as ex- 
pected, correlated with the test score. But even this rudimentary intel¬ 
ligence test score predicted political sophistication as well as education 
did. To Russell Neuman, the study s author, “the evidence supports the 
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idea of an independent cognitive-ability effect” as part of the proved 
link between socioeconomic Status and political participation.' 1 

We do not imagine that we have told the entire story of political par¬ 
ticipation. Age, sex, and ethnic identity are among the individual fac- 
tors that we have omitted but that political scientists routinely examine 
against the background of voting laws, regional variations, historical 
events, and the general political climate of the country. In various pe- 
riods and to varying degrees, these other factors have been shown to be 
associated with either the sheer level of political involvement or its 
character. Oldei people, fot example, are more likely to vote than 
younger people, up to the age at which the debilities of age intervene; 
women in the past participated less than men, but the gap has narrowed 
to the vanishing point (especially for educated men and women); dif¬ 
ferent ethnic groups resonate to different political causes.’’ 

Our locus on education and intelligence similarly gives insufficient 
attention to other personal rraits that influence political participation.' ’ 
People vary in their sense of civic duty and in the strength of their party 
affiliations, apart from their educarional or intellectual level; their per¬ 
sonal values color their political allegiances and how intensely they are 
feit. Their Personalities are expressed not just in personal life but also 
in their political actions (or inactions). 

The bottom line, then, is not that political participation is simple to 
describe but that, despite its cornplexity, so narrow a ränge of individ¬ 
ual factors carries so large a bürden of explanation. For example, the 
zero-order correlations between intelligence and the fourteen political 
dimensions in the study of high school studentsdescribed ahove ranged 
from .01 to .53, with an average of .22; the average correlation with the 
youngsters socioeconomic background was .09.^ For the sentiment of 
civic duty—the dosest approximation to civility in this particular set 
of dimensions—the correlation with intelligence was .4. As we cau- 
tioned above, this may be an overestimate, but perhaps not by much: 
The zero-order correlation between scores on a brief vocabulary test and 
the political sophistication of a sample of adults was .33.” The coeffi- 
cients for rated intelligence in a multivariate analysis of political so¬ 
phistication were more than twice as large as for any of the other 
variables examined, which included education, occupation, age, and 
parental interest in politics. 36 

The coherence of the evidence linking IQ and political participation 


as a whole cannot be neglected. The conti nuity of the re lat ionship over 
the life span gives it a plausibility that no single study can command. 
The other chapters in Part II have shown that cognitive ability often 
accounts for the importance of socioeconomic dass and underlies much 
of the Variation that is usually attributed to education. It appears that 
the same holds for political participation. 

MIDDLE-CLASS VALUES: DATA FROM THE NLSY 

The NLSY does not per mit us to extend this discussion directly. None 
of the questions in the study asks about political participation or knowL 
edge, But as we draw to the close of this long sequence of chapters about 
IQ and social behavior, we may use the NLSY to take another tack. 

For many years, “middle^class values” has heen a topic of debate in 
American public life. Many academic intellectuals hold middle-class 
values in contempt. They have a heiter reputation among the public, at 
large, however, where they are seen—rightly, in our view—as ways of 
behaving that produce social cohesion and Order. To use the language 
of this chapter, middle-class values are related to civility. 

Throughout Part II, we have been examining departures from mid- 
dle-class values: adolescents’ dropping out of school, babies born our of 
wedlock, men dropping out of the labor force or ending up in jail, women 
going on welfare. Let us now look at the glass as half full instead of half 
empty, concentrating on the people who are doing everything right by 
conventional Standards. And so, to conclude Part 11, we present the 
Middle (dass Values (MCV) Index. It has scores of “Yes” and “No.” A 
man in the NLSY got a “Yes” if by 1990 he had ohtained a high school 
degree (or more), heen in the labor force throughout 1989, never heen 
interviewed in jail, and was still married to his first wife. A woinan in 
the NLSY got a “Yes” if she had ohtained a high school degree, had never 
given hirth to a baby out of wedlock, had never heen interviewed in jail, 
and was still married to her first husband. People who failed any one of 
the conditions were scored “No.” Nevenmarried people who met all the 
other conditions except the marital one were exoluded from the analy- 
sis. We also excluded men who were not eligible for the laben* force in 
1989 or 1990 because they were physically unable to work or in school. 

Note that the index does not demand economic success. A man can 
earn a “Yes” despite being unemployed if he stays in the labor force. A 
woman can be on welfare and still earn a “Yes” if she höre her chiLiren 
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within marriage. Men and women alike can have incomes below the 
poverty line and still qualify. We do not require that the couple have 
children or that the wife forgo a career. The purpose of the MCV Index 
is to identify among the NLSY population, in their young adulthood 
when the index was scored, those people who are getting on with their 
lives in ways that fit the middle-class stereotype: They stuck with school, 
got married, the man is working or trying to work, the woman has con- 
fined her childbearing to marriage, and there is no criminal record (as 
far as we can teil). 

What does thishave to do with civility? We propose that even though 
many others in the sample who did not score “Yes” are also fine citizens, 
it is this population that forms the spine of the typical American Com¬ 
munity, filling the seats at the PTA meetings and the pews at church, 
organizing the Rotary Club fund-raiser, coaching the Litrle League 
team, or circulating a petition to put a stop light at a dangerous inter- 
section—and shoveling sidewalks and returning lost wallets. What 
might IQ have to do with qualifying for this group? As the table shows, 
ahout half of the sample earned “Yes” scores. They are markedly con- 


Cognitive Ability and the Middle Class Values Index 


Probability of scoring “Yes” on the MCV Index 

80%- 



-i-1-1-1-r 

Very low Very high 

<-2S0s) (+2 SDs) 

Note: for compuring the plor, age and either SPS (for rhe black eurve) or IQ (for ehe gray 
curvo) wen* .sei ar rht-ir mean values. 


Whites and the Middle-Class Values Index 

Cognitive Class 

Percentage Who 

Scored “Yes” as of 1990 

I Very bright 

74 

II Bright 

67 

III Normal 

50 

IV Du 11 

30 

V Very dull 

16 

Overall 

51 


centrated among the brighter people, with progressively smaller pro- 
portions on down through the cognitive classes, to an extremely small 
16 percent of the Class Vs qualifying. 

Furthermore, as in so many other analyses throughout Part II, cogni¬ 
tive ability, independent of socioeconomic background, has an impor¬ 
tant causal role to play. Below is the final Version of the graphic you 
have seen so often. 


As inruition might suggest, “uphringing” in the form of socioeco- 
nomic background makes a significant difference. But for the NLSY 
sample, it was not as significant as intelligence. Even when we conduct 
our usual analyses with the education suhsamples—thereby guarantee- 
ing that everyone meets one of the criteria (finishing high school)—a 
significant independent role for IQ remains. Its magnitude is diminished 
for the high school sample but not, curiously, for the College sample. 
The independent role of socioeconomic background hecomes insignif- 
icant in these analyses and, in the case ot the high-school-only sample, 
goes the “wrong” way after cognitive ability is taken into aecount. 

Much as we have enjoyed preparing the Middle Class Values Index, 
we do not intend it to become a new social Science benchmark. Its mod- 
est goals are to provide a vantage point on correlates of civility in a pop¬ 
ulation of young adults and then to serve as a reminder that the 
old-fashioned virtues represented through the index are Associated with 
intelligence. 
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THE DIFFERENCE BETWEEN BEING SMART AND BEING CIVIL 

Cognitive ability is a raw material for eivility, neu the thing itself. Snp- 
pose thar the task facing a Citizen is to vote on an initiative proposing 
some environmental policy involving (as environmental issues usually 
do) complex and subtle trade-offs between costs and benefits. Above- 
average intelligcnce means that a person is likely to be berter read and 
hetter able to think rhrough (in a purely technical sense) tbose trade- 
ofts. On the average, smarter people are more able to understand points 
ot vievv other than their own. But beyond these contributions of intel- 
ligence to cirizenship, high intelligence also seems to he associated with 
an interest in issues of civil concern. It is associated, perhaps surpris- 
ingly to some, with the behaviors that we identify with middle-class 
values. 

We should emphasize that vast quantities of this raw material called 
intelligence are not needed for many of the most fundamental forms 
of eivility and moral behavior. All of us might well pause at this point 
to think of the abundant exarnples of smart people who have been 
conspicuously uncivil. Yet these qualifications notwithstanding, the Sta¬ 
tistical tendencies remain. A smarter population is more likely to be, 
and more capable of being made into, a civil citizenry. For a nation pre- 
dicated on a high level of individual autonomy, this is a hier worth 
knowing. 


PART III 


The National Context 


Part II was circumscrihed, taking on social behaviors one at a time, fo- 
cusing on causal roles, with the analysis restricted to whites wherever 
the data permitted. We now turn to the national scene. This means coiv 
sidering all races and ethnic groups, which leads to the most contro- 
versial issues we will discuss: ethnic differences in cognitive ability and 
social behavior, the effects of fertility patterns on the distribution of in- 
telligence, and the ovcrall relationship of low cognitive ability to what 
has become known as the underclass. As we begin, perhaps a pact is ap- 
propriate. The facts about these topics are not only controversial but 
exceedingly complex. For our part, we will undertake to confront all the 
tough questions squarely. We ask that you read carefully. 




Chapter 13 


Ethnie Differences in 
Cognitive Ability 


Despite the forbiddmg air that envelops the topic, ethnic differences in cogni¬ 
tive ability are neither surjrrising nnr in douht. lairge human populations dif- 
fer in many ways, both cultural and biologicai It is not surfTrising thai they 
tni^ht di ff er at least slightly in their cognitive charactensncs. That they do is 
confirmed by the data on ethnic differences in cognitive ability from around 
the world. One message oj this chapter is that such differences are real and 
have consequerices. Another is that the facts are not as alarming as many peo- 
ple seem to fear. 

East Asians (e.g., Chinese, Japanese), ivhether in America or in Asia, 
typically earn higher scores on intelligence and achievement tests than white 
Americans . The prease size of their advantage is unclear; estimates ränge 
from just a few io len points. A märe certain difference between the races is 
that East Asians have higher nonverbal intelligence than whites while being 
equal, or perhaps slightly lower, in verbal intelligence. 

The difference in test scores between A/rican-Amerieam and European- 
Americans as measured in dozens of reputable Studies has converged on ap- 
Irroximately a one Standard deviation difference for several decades. Translated 
into centiles, this means that the average white person tests higher than about 
84 percent of the population of blacks and that the average black person tests 
higher than about 16 percent of the population ofwhites. 

The average black and white differ in IQ at every level of socioeconomic 
Status (SES), but they differ more at high levels of SES than at low levels. At- 
tempts to explain the difference in terms of test bias have failed. The tests have 
approximately equal predictive force for whites and blacks. 

ln the past few decades, the gap between blacks and whites narrowed by 
perhaps three IQ points. The narrowing appears to have been mainly caused 
hy a shrin/ang number of very low scores in the black population rather than 
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an incrcYLsrng number ofhigh scores. Improvements in the economic circurr i- 
.stance.s of blacks, in the quality of the schools they attend, in better public 
health , and perhaps also diminishing racism may be nanowing the gap. 

The debate about whether and how much genes and envinmment have to 
do with ethnic differences remains unresolved. The universality of the con- 
tra.se in nonverbal and verbal skills between East Asians and European whites 
suggests , without quite proving, genetic roots. Another line of evidence point - 
ing toward a genetic factor in cognitive ethnic differences is that blacks and 
whites differ most on the tests that are the best measures of g, or general in - 
telligence. On the other hand, the scores on even highly gdoaded tests can be 
influenced to sorne extern by changing environmental factors over the course 
of a decade or less. Beyond that, some social scientists have challenged the 
premise that intelligence tests have the same meaning for people ivho live in 
different cultural settings or ivho.se forebears had very different histories. 

Nothing seems more fearsome to many commentators than the possibility 
that ethnic and race differences have any genetic component at all. This be¬ 
lief is a fundamental error. Even if the differences between races were entirely 
genetic (which they surely are not), it should truike no practical difference in 
how individuals deal with each other. The real danger is that the elite ivi.sdom 
on ethnic differences—that such differences cannot exist —ivi/l shift to oppo- 
site and equally unjustified extremes. Open and informed discussion is the one 
certain way to protect society from the dangers of one extreme view or the 
other. 


E thnic differences in measured cognitive ability have heen found 
since in telligence tests were invented. The battle over the meaning 
of these differences is largely responsihle for today’s controversy over in- 
telligence testing itself. That many readers have tumed first to this chap- 
ter indicates how sensitive the issue has hecome. 

Our primary purpose is to lay out a set of Statements, as precise as the 
state of knowledge permits, ahout what is currendy known about the 
size, nature, validity, and persistence of ethnic differences on measures 
of cognitive ahility. A secondary purpose is to try to induce elarity in 
ways of thinking ahout ethnic differences, for discussions ahout such dif¬ 
ferences tend to run away with themselves, hlending issues of fact, the- 
ory, ethics, and public policy that need to be separated. 

The first thing to remember is that the differences among individu¬ 


als are far greater than the differences between groups. If all the ethnic 
differences in intelligence evaporated overnight, most of the intellec- 
tual Variation in America would endure. The remaining inequality 
would still strain the political process, because differences in cognitive 
ahility are prohlematic even in ethnically homogeneous societies. The 
chapters in Part 11, looking only at whites, should have made that clear. 
But the politics of cognitive inequality get hotter—sometimes too hot 
to handle—vvhen they are attached to the politics of ethnicity. We he- 
lieve that the best way to keep the temperature down is to work through 
the main facts carefully and methodically. This chapter first reviews the 
evidence hearing on ethnic differences in cognitive ahility, then turns 
to whether rhe differences originäre in genes or in environments. At 
the chapters end, we summarize what this knowledge ahout ethnic dif¬ 
ferences means in practical terms. 

We frequently use the word ethnic rather than race, because race is 
such a difficult concept to ernploy in the American context. 111 What 
does it me an to he “black” in America, in racial terms, when the word 
black (or African-American) can be used for people whose ancestry is 
more European than African? How are we to classify a person whose 
parents hail from Panama hur whose ancestry is predominantly African? 
Is he a Latino? A black? The rule we follow here is to classify people ac- 
cording to the way they classify themselves. The studies of “blacks” or 
“Latinos” or “Asians” who live in America generally denote people who 
say they are black, Latino, or Asian—no more, no less. 


Ethnic Nomenclature 


We want ro call people wharever they prefer to be called, ine lud ing their 
preferences for ethnic labels. As we write, however, there are no hard-and- 
fast rules. People from Latin America wish to be known according to their 
national origin: Cuban-American, Mexican-American, Puerto Rican, and 
so forth. Hispanic is still the U.S. government’s official label, but Ixxäno has 
gained favor in recent years. We use Latino. Opting for common usage and 
simplicity, we usually use black instead of African-American and white 
(which always refers to non-Latino whites) instead of European-Amer/can 
or Angln. Americans of Asian descent are called Asian when the context 
leaves no possibility ofeonfusion with Asians living in Asia. We shift to 
the hyphenated versions foreveryone when it would avoid such confusions 
or when, for styl ist ic reasons, the hyphenated versions seem appropriate. 
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It would be disingenuous to leave the racial issue at that, however, 
for race is offen on people’s minds when they think about IQ. Thus we 
will eventually comment on cognitive differences among races as they 
might derive from genetic differences, telling a story that is interesting 
but still riddled with more questions than answers. This prompts a sec- 
c>nd point to be understood at the outset: There are differences hetween 
races, and they are the ruie, not the exception. That assertion may seem 
controversial to some readers, hur it verges on tautology: Races are by 
definition groups of people who differ in characteristic ways. Intellec- 
tual fashion has dictated that all differences must be denied except the 
absolutely undeniable differences in appearance, but nothing in biology 
says this should be so. On the contrary, race differences are varied and 
comp lex—and they make the human species more adaptahle and more 
interesting. 

THE TESTED INTELLIGENCE OF ASIANS, BLACKS, AND 
WHITES 

So mach for preliminaries. Answers to commonly asked questions about 
the ethnic groups in America follow, beginning with the basics and 
inoving into successively more complicated issues. The black-white dif- 
ference receives by far the most detailed examination because it is the 
most controversial and has the widest social ramifications. Bur the most 
common question we have been asked in recent years has not been about 
blacks but about Asians, as Americans have watched the spectacular 
economic success of the Pacific rim nations at a distance and, closer to 
home, become accustomed to seeing Asian Immigrant children cob 
lecting top academic honors in America’s schools. 

Do Asians Have Higher IQs Than Whites? 

Probably yes, if Asian refers to the Japanese and Chinese (and perhaps 
also Koreans), whorn we will refer to here as East Asians. How much 
higher is still unclear. Richard Lynn, a leading scholar of racial and eth- 
nie differences, has reviewed the assembled data on overall Asian IQ in 
two major articles. In his 1991 review of the literature, he put the me¬ 
dian IQ for the studies of Chinese living in Hong Kong, Singapore, Tai¬ 
wan, and China proper at 110; the median IQ for the studies of Japanese 
living in Japan at 103; and the median for studies of East Asians living 
in North America at 103c But as Lynn acknowledges, these compar- 


isons are imprecise because the IQs were not corrected for the changes 
that have been observed over time in national IQ averages. ln Lynn’s 
1987 Compilation, where such corrections were made, the medians for 
both Chinese and Japanese were 103. 5 Mean white American IQ is typ- 
ically estimated as 101 to 102. 4 Additional studies of Chinese in Hong 
Kong, conducted by J. W. C. Chan using the Ravens Standard Progres¬ 
sive Matrices, a nonverbal test that is an especially good measure of g, 
found IQ equivalents in the region of 110 for both elementary and sec¬ 
ondary students, compared to about 100 for whites in Hong Kong. 5 An- 
other study postdating Lynn’s review compared representative samples 
of South Korean and British 9-year-olds and found an IQ difference of 
nine points/ 1 

The most extensive Compilation of East Asian cognitive performance 
in North America, by Philip Vemon, included no attempt to strike an 
overall estimate for the current gap between the races, but he did draw 
conclusions about East Asian-white differences in verbal and nonver¬ 
bal abilities, which we will describe later in the chapter. 7 In addition to 
studies of abilities, Vernon compiled extensive data on the schoolwork 
of East Asians, documenting their superior performance by a variety of 
measures ranging from grades to the acquisition of the Ph.D. Is this su¬ 
perior performance caused by superior IQ? James Flynn has argued that 
the real explanation for the success of Asian-Americans is that they are 
overachievers. rt He also says that Asian-Americans actually have the 
same nonverbal inrelligence as whites and a fractionally lower verbal 
intelligence. 0 Richard Lynn disagrees and concludes from the same data 
used by Flynn that there is an ethnic difference in overall IQ as well. 10 

The NLSY is not much help on this issue. The sample contained only 
forty-rwo East Asians (Chinese, Japanese, and Koreans). Their mean 
IQ was 106, compared to the European-American white mean of 103, 
consistent with the evidence that East Asians have a higher IQ than 
whites but based on such a small sample that not much can be made 
of it. 

The indeterminancy of the debate is predictable. The smaller the IQ 
difference, the more questionable its reality, and this has proved to be 
the case with the East Asian-white difference. It is difficult enough to 
find two sets of subjects within a single city who can be compared with- 
out problems of Interpretation. Can one compare test scores obtained 
in different years with different tests for students of different ages in dif¬ 
ferent cultural settings, drawn from possibly different socioeconomic 
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populations? One answer is that it can be done through techniques that 
take advantage of patterns observed over many studies. Lynn in partic- 
ular has responded to each new critique, in some cases providing new 
data, in others refining earlier estimates, and always pointing to the 
striking similarity of the results despite the disparity of the tests and set- 
tings. 11 But given the complexities of crossnational comparisons, the is- 
sue must eventually be settled by a sufficient body of data obtained from 
identical tests administered to populations that are comparable except 
for race. 

We have been able to identify three such efforts. In one, samples of 
American, British, and Japanese students ages 13 to 15 were adminis¬ 
tered a test of abstract reasoning and spatial relations. The American 
and British samples had scores within a point of the standardized mean 
of 100 on both the abstract and spatial relations components of the test; 
the Japanese adolescents scored 104.5 on the test for abstract reasoning 
and 114 on the test for spatial relations—a large difference, amounting 
to a gap similar to the one found by Vernon for Asians in America. 12 

In a second set of studies, 9-year-olds in Japan, Hong Kong, and 
Britain, drawn from comparable socioeconomic populations, were ad¬ 
ministered the Ravens Standard Progressive Matrices. The children 
from Hong Kong averaged 113; from Japan, 110; and from Britain, 
100—a gap of well over half a Standard deviation between both the 
Japanese and Hong Kong samples and a British one equated for age and 
socioeconomic Status. 13 

The third set of studies, directed by Harold Stevenson, administered 
a battery of mental tests to elementary school children in Japan, Tai¬ 
wan, and Minneapolis, Minnesota. The key difference between this 
study and the other two was that Stevenson and his colleagues carefully 
matched the children on socioeconomic and demographic variables. 14 
No significant difference in overall IQ was found, and Stevenson and 
colleagues concluded that “this study offers no support for the argument 
that there are differences in the general cognitive functioning of Chi¬ 
nese, Japanese, and American children.” [15] 

Where does this leave us? The parties in the debate are offen indi- 
vidually confident, and you will find in their articles many flat State¬ 
ments that an overall East Asian-white IQ difference does, or does not, 
exist. We will continue to hedge. Harold Stevenson and his colleagues 
have convinced us that matching subjects by socioeconomic Status can 
reduce the difference to near zero, but he has not convinced us that 


Jews, Latinos, and Gender 


In the text we focus on three major racial-ethnic groupings—whites, East 
Asians, and blacks—because they have dominated both the research and 
contentions regarding Intelligence. But whenever the subject of group dif¬ 
ferences in IQ comes up, three other questions are sure to be asked: Are 
Jews really smarter than everyone eise? Where do Latinos fit in, compared 
to whites and blacks? What about women versus men? 

jews—specifically, Ashkenazi Jews of European origins—test higher 
than any other ethnic group. 16 A fair estimate seems to be that Jews in 
America and Britain have an overall IQ mean somewhere between a half 
and a full Standard deviation above the mean, with the source of the dif¬ 
ference concentrated in the verbal component. In the NLSY, ninety-eight 
whites with IQ scores identified themselves as Jews. The NLSY did not try 
to ensure representativeness within ethnic groups other than blacks and 
Latinos, so we cannot be sure that the ninety-eight Jews in the sample are 
nationally representative. But it is at least worth noting that their mean 
IQ was .97 Standard deviation above the mean of the rest of the popula- 
tion and .84 Standard deviation above the mean of whites who identified 
themselves as Christian. These tests results are matched by analyses of oc- 
cupational and scientific attainment by Jews, which consistently show 
their disproportionate level of success, usually by Orders of magnitude, in 
various inventories of scientific and artistic achievement, 17 

The term Latino embraces people with highly disparate cultural her- 
itages and a wide ränge of racial Stocks. Many of these groups are known 
to differ markedly in their social and economic profiles. Add to that the 
problem of possible language difficulties with the tests, and generalizations 
about IQ become especially imprecise for Latinos. With that in mind, it 
may be said that their test results generally fall about half to one Standard 
deviation below the national mean. In the NLSY, the disparity with whites 
was .93 Standard deviation. This may be compared to an overall average 
difference of .84 Standard deviation between whites and Mexican-Ameri- 
cans found in the 1960s on the tests used in the famous Coleman report 
(described in Chapter 17). 18 We will have more to say about the Interpre¬ 
tation of Latino scores with regard to possible language bias in Chapter 14. 

When it comes to gender, the consistent story has been that men and 
women have nearly identical mean IQs but that men have a broader distri- 
bution. In the NLSY, for example, women had a mean on the Armed Forces 
Qualification Test (AFQT) that was .06 Standard deviation lower than the 
male mean and a Standard deviation that was .11 narrower. For the Wech¬ 
sler Intelligence Scale for Children, the average boy tests 1.8 IQ points 
higher than the average girl, and boys have a Standard deviation that is .8 
point larger than girls. 19 The larger Variation among men means that there 
are more men than women at either extreme of the IQ distribution. 





276 The National CJontext 


Ethnie Differences in Cogriitive Ability 277 


matching by socioeconomic Status is a good idea if one wanrs to know 
an estimate of the overall difference herween East Asians and whites 
(wc will return to the question of matehing by soeioeconomic Status 
when we discuss comparisons herween blaeks and whites). In our judg- 
ment, the balance of the evidence supports the proposition that rhe 
overall East Asian mean is higher rhan the white mean. If we had to put 
a numher on it, three IQ points currently most resemhles a consensus, 
tentative though it still is. East Asians have a greater advantage than 
that in a partieular kind of nonverbal intelligence, deseribed later in the 
chapter. 

Do Blacks Score Differently from Whites on Standanlized Tests of 
Cognitive Ability? 

If the samples are chosen to be representative of the American popula- 
tion, the answer has been yes for every known test of cognitive ability 
that meets basic psychometric Standards of reliabilitv and validityT The 
answer is also yes for almost all of the studies in which the black and 
white samples are matched on some special characteristics-—samples of 
juvenile delinquenrs, for example, or of graduate students—hut there 
are exceptions. The implication of this effect of selecting the groups to 
he compared is discussed later in the chapter. Since black-white Jitter- 
ences are the ones that strain discourse most severely, we will probe 
deeply into the evidence and its meaning. 

How Large Is the Black-White Difference? 

The usual answer to this question is one Standard deviation. 1 ln dis- 
cussing IQ tests, for example, the black mean is commonly given as 85, 
rhe white mean as 100, and the Standard deviation as 15. Rur rhe dif- 
terences observed in any given study seldom conform exactly to one 
Standard deviation. The figure below shows the distribution of the 
black-white difference (suhsequently ahhreviared as rhe U B/W Jitter- 
ence”) expressed in Standard deviations, in the American studies con- 
ducted in this Century that have reported rhe IQ means of a black sample 
and a white sample and meet basic requirements of interpretability as 
deseribed in rhe note. 1 “ 1 A total of 156 studies are represented in the 
plot, and the mean B/W difference is 1.08 Standard deviations, or about 
sixteen IQ points.The spread of results Ls substantial, however, re- 
flecting the diversity of the age of the subjects, their geographic loca- 


OverView of studies of reporting biack-white differences in 
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Magnitude of the black-white difference in test scores, 
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Sonnys; Sluicv 1066; Osbomc and McGurk 1982; Sattler J988; Vincent 1001; Jenson 1085, 
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tion, their hackground characteristics, the tests themselves, and sam¬ 
pling error. 

When we focus on the studies that meet stricter criteria, the ränge of 
values for the R/W difference narrows accordingly. The ränge of resulrs 
is eonsiderahly reduced, for example, for studies that have taken place 
since 1940 (after testing’s most formative period), outside the South 
(where the largest R/W differences are found), with subjects older than 
age 6 (after scores have become more stähle), using full test batteries 
from one of rhe major IQ tests, and with Standard deviations reported 
for rhat specific test administration. Of the torry-tive studies meeting 
these criteria, all hut nine of the B/W differences are clustered between 
.5 and 1.5 Standard deviations. The mean difference was 1.06 Standard 
deviations, and all hut eight of the thirty-one reported a B/W difference 
greater than .8 Standard deviation. 

Still more rigorous selection criteria do not diminish the size of the 
gap. For example, with tests given outside the South only after 1960, 
when people were increasingly sensitized to racial issues, the numher of 
studies is reduced to twenty-four, but the mean difference is 1.10 Stan¬ 
dard deviations. The NLSY, administered in 1980 to by far the largest 
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sample (6,502 whites, 3,022 blacks) in a national study, found a ditfer- 
ence ot 1.21 Standard deviations on the AFQT.^ 


Computing the B/W Difference 

The simplest vvay to compute the B/W difference vvhen limited informa- 
■rion is available is to take the two means and to compare them using the 
Standard deviation for the reference population, detined in this case as 
whites. This is how the differences in the tigure on page 277 showing the 
results of 1 56 studies vvere computed. When all the data are available, how- 
ever, as in the case of the NLSY, a more accurate method is available, which 
takes into account the Standard deviations within each population and the 
relative size of the samples. The equation is given in the noteL S| LJnless 
otherwise specified, all of the subsequent expressions of the B/W differ¬ 
ences are based on this method. (For more about the scoring of IQs in the 
NLSY, see Appendix 2.) ! 

I_ 

Answering the question “How large is the difference?“ in terms of 
Standard deviations does not convey an intuitive sense of the size of the 
gap. A rouglvand-ready way of thinking about the size of the gap is to 
recall that one Standard deviation above and helow the mean cuts off 
the 84th and 16th percentiles of a normal distribution. ln the case of 
the B/W difference of 1.2 Standard deviations found in the NLSY, a 
person with the black mean was at the 1 Ith percentile of the white dis- 
tribiition, and a person with the white mean was at the 9Ist percentile 
of the black distribution. 

A difference of this magnitude should be thought of in several differ¬ 
ent ways, each with its own important implications. Recall first that the 
American black population numbers more than 30 million people. lf the 
results trom the NLSY apply to the total black population as ot the 1990s, 
around 100,000 blacks fall into Class 1 of our five cognitive classes, wirb 
IQs of 125 or higher." 6 One hundred thousand people is a lot of people. 
It should he no surprise to see (as one does every day) blacks function- 
ing at high levels in every intellectually challenging field. 

It is important to understand as well that a difference of 1.2 Standard 
deviations means considerable overlap in the cognitive ability distrib- 
ution for blacks and whites, as shown for the NLSY population in the 
figure helow. For any equal number of blacks and whites, a large pro- 
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portion have IQs that can bc matched up. This is the distribution to 
keep in mind whenever thinking ahout individuals. 

Rut an additional complication has to be taken into account: ln the 
United States, there are about six whites for every black. This means 
that the IQ overlap of the two populations as they actually exist in the 
United States looks very different from the overlap in the figure just 
above. The next figure presents the same data from the NLSY when the 
distributions are shown in proportion to the aetual population of young 

The black and white IQ distributions in the NLSY, Version II 


Frequency distributions proportional to the 
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people represented in the NLSY. This figure shows why a B/W differ- 
ence can he prohlemaric ro American society as a whole. At rhe lower 
end of the IQ ränge, there are approximately equal numbers of blacks 
and whites. Rut throughout the upper half of the ränge, the dispropor- 
tions hetween the number of whites and blacks at any given IQ level 
are huge. To the extent that the difference represents an authenric dit- 
ference in cognitive functioning, the social consequences are poteiv 
tially huge as well. But is the difference authentic. 7 

Are the Differences in Black and White Scores Attributable to Cultural 
Bios or Other Artifacts of the Test? 

Appendix 5 contains a discussion of the state of knowledge regarding 
test bias. Here, we shall quickly review the Basic findings regarding 
blacks, without repeating the citations in Appendix 5, whicb we urge 
you to read. 

EXTERNAL EviDENt;E of Rias. Tests are used to predict things—most 
common ly, to predict performance in school or on the job. Chapter 3 
discussed this issue in detail. You will recall that the ahility of a test to 
predict is known as its validity. A test with high validity predicts ac- 
cu rate ly; a test with poor validity makes many mistakes. Now suppose 
that a test’s validity differs for the members of two groups. To use a con- 
erete example: The SAT is used as a tool in College admissions because 
it has a certain validity in predieting College performance. If the SAT is 
biased against blacks, it will underpredict their College performance. If 
rests were biased in this way, blacks as a group would da better in coh 
lege than the admissions office expect.ed based just on their SATs. It 
would be as if the test underestimated the “true” SAT score of the blacks, 
so the natural remedy for this kind of bias would be to compensate the 
black applicants by, for example, adding the appropriate number of 
points onto their scores. 

Predictive bias can work in another way, as when the test is simply 
!ess reliable—that is, less accurate—for blacks than for whites. Suppose 
a test used to select police sergeants is more accurate in predieting rhe 
performance of white candidates who become sergeants than in pre¬ 
dict ing the performance of black sergeants. It doesn’t underpredict for 
blacks, but rather fails to predict at all (or predicts less accurately). In 
these cases, the natural remedy would he to give less weight to rhe test 
scores of blacks than to those of whites. 


The key concept for both types of bias is the same: A test biased against 
blacks does not predict black performance in the real world in the same way 
t hat it predicts white performance in the real world. The evidence of bias is 
externa! in the sense that it shows up in differing validities for blacks and 
whites. External evidence of Rias has been sought in hundreds of stud- 
ies. It has been evaluated relative to performance in elementary school, 
in secondary school, in the university, in the armed forces, in unskilled 
and skilled jobs, in the professions. Overwhelmingly, the evidence is 
that the major standardized tests used to help make school and joh de- 
cisions 12 ' 1 do not underpredict hlack performance, nor does the expert 
Community find any other general or systematic difference in the pre¬ 
dictive accuracy of tests for blacks and whites. 1 "' 1 

INTERNAL EvinHNCT OF Rias. Predictive validity is the ulrimate crite- 
rion for bias, because it involves the proof of the pudding for any test. 
Rut although predictive validity is in a rechnical sense the decisive is¬ 
sue, our impression from talking about this issue with colleagues and 
friends is that other types of potential bias loom (arger in their imagi- 
nations: the many things that are put linder the umhrella label of cul¬ 
tural bias.” 

The most common eharges of cultural bias involve rhe putative cul¬ 
tural loading of items in a test. Here is an SAT analogy item rhar has 
become lamous as an example of cultural bias: 

RUNNER:M ARATHON 

(A) envoyiembassy 

(B) martyr:massacre 

(C) oarsmamregatta 

(D) referee: tournamen r 

(E) horse:stable 

The answer is “oarsmamregatta”—fairly easy if you know what both a 
marathon and a regatta are, a matter ofguesswork otherwise. How would 
a black youngster from rhe inner city ever have heard of a regatta 7 Many 
view such items as prüOf that the tests must be biased against people 
from disadvantaged backgrounds. “Clearly,” writes a critic of resting, 
citing this example, “this item does not measure students aptitude oi 
logical reasoning ahility, but knowledge of upper-middle-class recrca- 
tional activity.” |2 ' >! ln the language of psychometrics, this iscalled interna! 
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evidence of bias, as contrasted with the extcmal evidence of differen¬ 
tial prediction. 

The hypothesis of bias again lends icself to direct examination. In ef- 
fect, the SAT critic is saying that culturally loaded items are producing 
at least some of the B/W difference. Ger rid of such items, and the gap 
will narrow. Is he correct? When we look at the results for items that 
have answers such as “oarsmamregatta” and the results for items that 
seem to be empty of any cultural information (repeating a sequence of 
numbers, fc^r example), are there any differences ? ; 101 Are differences in 
group test scores concentrated among certain items? 

The technical literature is again clear. In study after study of the lead- 
ing tests, the hypothesis that the B/W difference is caused hy questions 
with cultural content has been contradicted by the factsd 1 Items that 
the average white test taker finds easy relative to other items, the aver- 
age black test taker does too; the same is true for items that the average 
white and black find diffieult. Inasmuch as whites and blacks have dif¬ 
ferent Overall scores on the average, it follows that a smaller proportion 
of blacks get right answers for either easy or hard items, hut the Order of 
difficulty is virtually the same in each racial group. For groups that have 
special language considerations—Latinosand American Indians, for ex¬ 
ample—some internal evidence of bias has been found, unless English 
is their native language. 12 

Studies comparing blacks and whites on various kinds of IQ tests find 
that the B/W difference is not created by items that ask about regattas 
or who wrote Hamlet, or any of the other similar examples cited in crit- 
ieisms of tests. How can this be? The explanation is complicated and 
gties deep into the reasons why a test item is “good” or “bad” in mea- 
suring intelligence. Here, we restrict ourselves to the conclusion: The 
B/W difference is wider on items that appear to be culturally neutral than on 
items that appear to be culturally loaded. We italicize this point because it 
is borh so well established empirically yet comes as such a surprise to 
most people who are new to this topic. We will elaborate on this find- 
ing later in the chapter. In any case, there is no longer an important 
technical dehate over the conclusion that the cultural content of test 
items is not the cause of group differences in scores. 

Motivation TO Try.” Suppose that the natu re of cultural hias does not 
lie in predictive validity or in the content of the items but in what might 
be cailed “test willingness.” A typical black youngster, it is hypothesized, 


comes to such tests with a mindset different from the white subjects. He 
is less attuned to testing Situation« (from one point of view), or less in- 
clined to put up with such nonsense (from another), Perhaps he just 
doesn’t give a damn, since he has no hopes of going to College or other' 
wise benefiting from a good test score. Perhaps he figures that the test is 
biased against him anyway, so what s the point. Perhaps he consciously 
refuses to put out his best effort because of the peer pressures against “act- 
ing white” in some inner-ciry schools. 

The studies that have attempted to measure motivation in such sit- 
uations have generally found that blacks are at least as motivated as 
whites.* 1 But these are not wholly convincing, for why shouldn’t the 
measures of motivation be just as inaccurate as the measures of eognt- 
tive ability are alieged to be? Analysis of internal characteristics of the 
tests once again öfters the best leverage in examining this broad by- 
pothesis. Two sets of data seem especially pertinent. 

The first involves the digit span subtest, part of the widely used Wech¬ 
sler intelligence tests. It has two forms: forward digit span, in which the 
subject tries to repeat a sequence of numbers in the order read to him, 
and backward digit span , in which the subject tries to repeat the sequence 
of numbers backward. The test is simple in concept, uses numbers that 
are familiär to everyone, and calls on no cultural information besides 
knowing numbers. The digit span is especially informative regarding test 
motivation not just because of the low cultural loading of the items but 
because the backward form is twice asg-loaded as the forward form, it is 
a much better measure of general intelligence. The reason Ls that revers- 
ing the numbers is mentally more detnanding than repeating them in 
the heard order, as readers can determine for themselves by a litrle seif¬ 
test ing. 

The two parts of the subtest have identical content. They occur 
at the same time during the test. Each subject does both. Rur in most 
studies the hlack-white difference is about twice as great on backward 
digits as on forward digits. |H1 The question arises: How can lack of 
motivation (or test willingness or any other explanation of that type) 
explain the difference in performance on the two parts of the same sub¬ 
test? 35 

A similar question arises from work on reacrion time. Several psy- 
chometricians, led by Arthur Jensen, have been exploring the underly- 
ing nature of g by hypothesizing that neurologic processing speed is 
implicated, akin to the speed of the microprocessor in a Computer. 
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Smarter people process faster than less smart people. The strategy for 
testing the hypothesis Ls to give people extremely simple cognitive 
tasks—so simple that no conscious thought is involved—and to use pro 
cise timing methods to determine how fast different people perform 
these simple tasks. One commonly used apparatus involves a console 
with a semicircleofeight lights, each witha hutton next to it. In rhe mid¬ 
dle of the console is the “home” button. At the beginning of each trial, 
the subject is depressing the home button with his fingen One of the 
lights in the semicircle goes on. The subject moves his finger to the but- 
ton closest to the light, which turns it off. There are more complicated 
versions of the task (three lights go on, and the subject moves to the one 
that is farthest from the other two, for example), but none requires rnuch 
thought, and everybody gets every trial “right.” The subject’s response 
speed is broken into two measurements: reaction time (RT), the time it 
takes the subject to lift his finger from the home button after a target light 
goes on, and movement time (MT), the time it takes to move the finger 
from just above the home button to the target button. 1 ™ 1 

Francis Galton in the nineteenth Century believed that reaction time 
is associated with intelligence but could not prove it. He was on the 
right track after all. In modern studies, reaction time is correlated with 
the results from full-scale IQ tests; even more specifically, it is correlated 
with the g factor in IQ tests—in some studies, only with the g factor. 17 
Movement time is much less correlated with IQ or with gT This makes 
sense: Most of the cognitive Processing has been completed by the time 
the finger leaves the home hutton; the rest is mostly a function of small 
motor skills. 

Research on reaction time is doing much to advance our under- 
standing of the biological hasis of g. For our purposes here, however, it 
also öfters a test of the motivation hypothesis: The consistent resulr of 
many studies is that white reaction time is faster than black reaction 
time, but black movement time is faster than white movement tirneT 
One can imagine an unmotivated subject who thinks the reaction time 
test is a waste of time and does not try very hard. But the level of moti¬ 
vation, whatever it may be, seems likely to be the same for the measures 
of RT and MT. The question arises: How can one be unmotivated to do 
well during one split-second of a test but apparently moüvated during 
the next split-second? Results of this sort argue against easy explana- 
tions that appeal to differences in motivation as explanatory of the R/W 
difference. 


Unhorm Background Bias. Other kinds of bias discussed in 
Appendix 5 include the possibility that hlacks have less access to 
coaching than whites, less experience with tests (less “testwiseness”), 
poorer understanding of Standard English, and that their performance 
is affected by white examiners. Each of these hypotheses has heen 
investigated, for many tests, under many conditions. None has been 
sustained. In short, the testable hypotheses have led toward the 
conelusion that cognitive ability tests are not biased against hlacks. 
This leaves one final hypothesis regarding cultural bias that does not 
lend itself to empirical evaluation, at least not directly. 

Suppose our society is so steeped in the conditions that produce test 
bias that people in d isadvantaged groups underscore their cognitive abil- 
ities on all the items on tests, therehy hiding the internal evidence of 
bias. Ar the same time and for the same reasons, they underperform in 
school and on the job in relation to their true ahiliries, therehy hiding 
the external evidence. In other words, the tesrs may he biased against 
disadvantaged groups, but the traces of bias are invisihle hecause the 
bias permeates all areas of the group’s performance. Accordingly, it 
would he as useless to look for evidence of test bias as it would be for 
Einsteins imaginary person traveling near the speed of light to try rode- 
termine whether time has slowed. Einsteins rraveler has no clock that 
exists independent of his space-time contexr. In assessing test bias, we 
would have no test or criterion measure that exists independent of this 
cuhure and its history. This form of bias would pervade everything. 

To some readers, the hypothesis will seena so plausible that it is self- 
evidently correct. Before deciding that this must be the explanation for 
group differences in test scores, however, a few prob [eins must be over- 
come. First, the comments about the digit span and reaction time re- 
sults apply here as well. How can this uniform background hias suppress 
black reaction time but not the movement time? How can it suppress 
performance on backward digit span more than forward digit span? Sec- 
ond, the hypothesis implies that: many of the performance yardstieks in 
the society at large are not only biased, they are all so similar in the de- 
gree to which they distort the truth—in every occupation, every type 
of educational Institution, every achievement measure, every perfor¬ 
mance measure—that no differential distortion is picked up by the data. 
Is this plausible? 

It is not good enough to accept without question that a general “back¬ 
ground radiation” of bias, uniform and uhiquitous, explains away black 
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and white differences in test scores and performance measures. The hy- 
pothesis might, in theory, he true. But given the degree to which every- 
day experience suggests that the environment confronting hlacks in 
different sectors of American life is not uniformly hostile and given the 
consistency in results from a wide variety of cognitive measures, assum- 
ing that the hypothesis is true represents a considerahly longer leap of 
faith than the much more limited assumption that race prejudice is still 
a factor in American life. In the matter of test bias, this hrings us to the 
frontier of knowledge. 

Are the Differences in Overall Black and White Test Scores Attributüble 
to Differences in Socioeconomic Status! 

This question has two different answers depending on how the question 
is understood, and confusion is rampant. We will take up the two an- 
swers and their associated rationales separately: 

First Version: If you exvract the effects of socioeconomic dass, what haff- 
ffens to the overall magnitude. of the B/W difference! Blacks are dispropor- 
tionately in the lower socioeconomic classes, and socioeconomic dass 
is known to he associated with IQ. Therefore, many people suggest, part 
of what appears to he an ethnic difference in IQ scores is actually a so¬ 
cioeconomic difference. 

The answer to this version of the question is that the size of the gap 
shrinks when socioeconomic Status is Statistical ly extracted. The NLSY 
gives a result typical of such analyses. The B/W difference in the NLSY 
is 1.21. In a regression equation in which both race and socioeconomic 
hackground are entered, the difference between whites and blacks 
shrinks to .76 Standard deviation. 1401 Socioeconomic Status explains ^7 
percent of the original B/W difference. This relationship is in line with 
the results from many other studies. 1411 

The difficulty comes in interpreting what it means to “control” for 
socioeconomic Status. Mate hing the Status of the groups is usually jus- 
tified on the grounds that the scores people earn are caused to soine ex- 
tent by their socioeconomic Status, so if we want to see the “real” or 
“authentic” difference between them, the contrihution of Status must 
be excluded. 1421 The trouble is that socioeconomic Status is also a result 
of cognitive ability, as people of high and low cognitive ability move to 
correspondingly high and low places in the socioeconomic continuum. 
The reason that parents have high or low socioeconomic Status is in part 


a function of their intelligence, and their intelligence also affects the 
IQ of the children via both genes and environment. 

Recause of these relationships, "controlling” for socioeconomic Sta¬ 
tus in racial comparisons is guaranreed to reduce IQ differences in the 
samc way that choosing black and white samples from a school for the 
intellectually gifted is guaranreed to reduce IQ differences (assuming 
race-hlind admissions Standards). But the remaining difference is not 
necessarily more real or authentic than the one we Start with. This seems 
to be a hard point to grasp, judging from the pervasiveness of Control¬ 
ling ior socioeconomic Status in the sociological literature on ethnic dif¬ 
ferences. But suppose we were asking whether blacks and whites differed 
in sprinting speed, and controlled for “varsity Status” by examining only 
athletes on the track teams in Division 1 Colleges. Blacks would proba- 
hly still Sprint faster than whites on the average, but it would he a smaller 
difference than in the population at large. Is there any sense in which 
this smaller difference would be a more accurate measure of the racial 
difference in sprinting ability than the larger difference in the general 
population? We pose that as an interesting theoretical issue. ln terms of 
numhers, a reasonable rule of rhumb is that Controlling for socioeco¬ 
nomic Status reduces the overall B/W difference by about a third. 

Second version: As blacks move uff the socioeconomic ladder, do the dif¬ 
ferences with whites of similar socioeconomic Status diminish! The first Ver¬ 
sion of the SES/1Q question referred to the overall score of a population 
of hlacks and whites. The second version concentrates on the B/W dif¬ 
ference within socioeconomic classes. The rationale goes like this; 
Blacks score lower on average hecause they are socioeconomically at a 
disadvantage in our sociery. This disadvantage should most seriously 
handicap the children of hlacks in the lower socioeconomic classes, who 
suffer from greater barriers to education and occupational advancement 
than do the children of hlacks in the middle and upper classes. As blacks 
advance up the socioeconomic ladder, their children, less exposed to 
these environmental deficits, will do hetter and, hy extension, close the 
gap with white children of their dass. 

This expectation is not borne out by the data. A good way to illus¬ 
trate this is by using our parental SES index and matching it againsr the 
mean IQ score, as shown in the figure below. IQ scores increase with 
economic statu.s for both races. But as the figure shows, the magnitude 
of the B/W difference in Standard deviations does not decrease. Indeed, 
it gets larger as people move up from the very bottom of the socioeco- 



288 The National Gontext 


Ethnie Differences in Cognitive Ahility 289 


Black IQ scores go up with socioeconomic Status, but the black- 
white difference does not shrink 


Black/white IQ difference, 
in Standard deviations 
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Black mean [Q, 
in IQ points 

- 120 



(right-hand scale) 

0.0 -1-1-1-1-1 -i-1-1-1- 

Ist 2d 3d 4th 5th 6th 7th 8th 9t h lOth 

Parental SES, by decile 


-110 

- 100 


- 90 


-80 


t 70 


nomic ladder. The pattern shown in the figure is consistent with many 
other major studies, except that the gap flattern out. In orher studies, 
the gap has continued to increase throughout the ränge of socioeco- 

I4M 

norme Status. 1 

How Do African-Americans Compare with Blacks in Africa on 
Cognitive Tests! 

This question often arises in the context of black-white comparisons in 
America, the thought being that the African black population has not 
been subjected to the historical iegacy of American black slavery and 
discrimination and might therefore have higher scores. Many studies of 
African students in primary and secondary schools, in both urban and 
rural areas, have included cognitive ability tests. As in the United 
States, it has been demonstrated in Africa that the same test items that 
discriminate best among blacksdiscriminate best among whites and that 
the same factors that depress white scores (tor example, coming trom a 
rural area) depress black scores. The predictive validity of tests for aca- 
demic and job performance seems to be about the same. ln general, the 


psychometric properties of the standardized tests are the same for blacks 
living in Africa as for American blacks. 44 

1t has been more difficult to assemble data on the score of the aver¬ 
age African black than one would expect, given the extensiveness of 
the test experience in Africa. In the same review of the literature that 
permitted the above generalizations, for example—a thirty-page article 
foilowed by a bibliography of more than 200 titles—not a single aver¬ 
age is reportedT One reason for this reluctance to discuss averages is 
that blacks in Africa, including urbanized blacks with secondary edu- 
cations, have obtained extremely low scores. Richard Lynn was able to 
assemble eleven studies in bis 1991 review of the literature. He esti- 
mated the median black African IQ to be 75, approximately 1.7 Stan¬ 
dard deviations below the U.S. overall population average, about ten 
points lower than the current figure for American blacks. 40 Where other 
data are available, the estimates of the black African IQ fall at least that 
low and, in seine mstances, even lower. 4 ' The IQ of “coloured” students 
in South Africa—of mixed racial background—has been found to be 
similar to that of American blacks. 48 

ln summary: African blacks are, on average, substantially below 
African-Americans in intelligence test scores. Psychometrically, there 
is little reason to think that these results mean anything different about 
cognitive functioning than they mean in non-African populations. For 
our purposes, the main point is that the hypothesis about the special cir- 
cumstances of American blacks depressing their test scores is not sub- 
stanriated by the African data. 


Is the Difference in Black and White Test Scores Diminishing! 

The answer is yes with (as usual) some qualifications. 

IQ Test Data. The most straightforward way to answer the question 
would be to examine the repeated administrations of the same IQ tests 
to comparable populations, but large, nationally representative IQ data 
are not produced every year (or even every decade). The NLSY data are 
among the most recent for a young adult population, and they have a 
B/W difference toward the high end of the ränge. The only post-1980 
study reporting black and white adult averages rhat we have found is 
the renorming of the Wechsler Adult Intelligence Scale (WA1S-R) in 
1981 in which the difference between blacks and a sample of whites 
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(that apparently did not try to discriminate hetween Larino and Anglo 
whites) was 1.0 Standard deviation . 40 

Recent data cm children teil opposite stories. ln a review of IQ tests 
of children conducted since 1980, Ken Vincent of the University 
of Houston reports results for four normative studies that showed 
a B/W difference of only seven IQ points tor the Ravens Standard Pro¬ 
gressive Matrices (SPM) and the Kaufman Assessment Rattery for Chip 
dren (K-ABC)P° Two other studies involving the Stanford-Binet IV 
found B/W differences cd ten points for children ages 7 to 1 1 and twelve 
points for children ages 2 to 6. 1,11 Qualifications must he attached to 
these findings. The B/W difference on the K-ABC normative sample 
has in particular been suhjected to reexamination suggesting thar rhe 
diminished gap largely reflected psychometric and Statistical artifacts. 1 ^ 1 
None the 1 es s, the data on children that Vincent reviews may he read as 
encouraging. The most impressive of the findings is the comparatively 
small B/W difterence of only seven IQ points on the Ravens SPM ad- 
ministered to 12-year-olds. This finding corresponds to jensens 1992 
study of black and white children in an upper-middle-class setting in 
which the difference on the Ravens SPM was similarly below the norm 
(a deficit corresponding to ten IQ points).^ 

ln contrast to Vincents optimistic conclusions, the NLSY shows a 
growing rauher than a shrinking gap in the next generation ot blacks 
and whites. As discussed in Chapter 15, the B/W difference hetween 
NLSY children is currently wider than the B/W difference separating 
their mothers. 

ACADHMiC APTITI.JDH AND ACHIEVEMENT TESTS. The most extensive ev- 
idenee ot a narrowing black-white gap can he found in longitudinal data 
front rhe National Assessment of Educarional Progress (NAEP), the 
American College Testing (ACT) examination, the SAT, a comparison 
of the 1972 and 1980 national high school surveys, and some statedevel 
achievement test data. We review theN AEP and the SAT here, and ot h- 
ers (which teil the same story) in Appendix 5. 

The National Assessment ot Educational Progress is an ongoing pro- 
gram, sponsored by the federal government to monitor the academic 
achievement ot the nation’s youth. It began in 1969, periodically test- 
ing 9-, 1 3-, and 17-year-olds in Science, mathematics, reading, and writ- 
ing in nationally representative samples. The table below shows the 
changes from the first round ot testing in 1969-1973 to the data for 


Reductions in the Black-White Difference on the 
National Assessment of Educational Progress 

White-Black Difference, in Change 

Standard Deviations 8 



1969-1973 

1990 


9'yeanokh 

Science 

1.14 

.84 

-.30 

Math 

.70 

.54 

-.16 

Reading 

.88 

.70 

-.18 

Average 

.91 

.69 

-.21 

1 3 'jear-olds 

Science 

.96 

.76 

-.20 

Math 

.92 

.54 

-.38 

Reading 

.78 

.40 

-.38 

Average 

.89 

.57 

-32 

17-year-o/d.s 

Science 

1.08 

.96 

-.12 

Math 

.80 

.42 

-.58 

Reading 

1.04 

.60 

-.44 

Average 

.97 

.66 

-.31 

Overaf/ average 

.92 

.64 

-.28 

Source; National Center lor Education Staiistics, 1991h. 


' The computations assume a Standard di 

eviatkm ot 50 . 



1990, expressed in Standard deviations. The “Change” column gives the 
earlier B/W difterence minus the later B/W difference, which is nega¬ 
tive if the gap is closing. The fourth component of the NAEP, a writing 
test, was inrroduced only in 1984, with replications in 1988 and 1990. 
Unlike all the others, it does not show a narrowing of the white-black 
gap (.46 SD in both 1984 and 1990) and is not included in the table. 

As the table indicates, black progress in narrowing the test score dis- 
crepancy with whites has been substantial on all three tests and across 
all of the age groups. The Overall average gap of .92 Standard deviation 
in the 1969-1973 tests had shrunk to .64 Standard deviation by 1990. 
The gap narrowed because black scores rose, not because white scores 
feil. Altogether, the NAEP provides an encouraging picture. 
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The first published breakdowns of SAT scores hy ethnicity appear for 
1976, when the downward trend in SAT scores narionwide alter 1963 
was nearing its bottom (see Chapter 18). From 1976 to 1993, the white- 
black gap in SAT scores narrowed from 1.16 to .88 Standard deviarion 
in the verbal portion of the test and from 1.27 to .92 Standard deviarion 
in the mathematies portion of the test. h41 Comparable narrowing has 
also brought black and white achievement test scores closer, as pro- 
sented in Appendix 3. Because the ethnic self-identification of SAT test 
takers contains some anomalies 0 and because the SAT pool is unrep- 
resentative of the general population, the numbers should be interpreted 
with caution. But even so, the SAT data indicate a narrowing gap. Black 
SAT test takers improved substantially more in scores than white SAT 
test takers, and neither the changes in the pool of test takers nor the 
welbadvertised national decline in SAT scores was responsihle, für rea- 
sons explained in the notes. ho1 

EXPLAININU THE: CONVERGENCE. Let us assume that during the past two 
decades black and white cognitive ability as measured by IQ has in fact 
converged by an amount that isconsistent with the convergence in ed- 
ucational aptitude measures such as the SAT or NAEP—a narrowing ot 
approximately .15 to .25 Standard deviation units, or the equivalent ot 
rwo to three IQ points overallT' 1 Why have the scores converged? The 
answer calls for speculation. 

We take for granred that individual variations in cognitive ability do¬ 
pend on hoth genes and environment (see Chapter 4)- ln a period 
as short as twenty years, environmental changes are likely to provide 
the main reason for the narrowing racial gap in scores. fvsl Real and im¬ 
portant though the problems ot the underclass are, and acknowledging 
that the underclass is disproportionately black, living conditions have 
improved for most Atrican-Americans since the 1950s—socially, eco- 
nomically, and educationally. 

Consider the schools that hlacks attend, for example. Some schools 
in the inner cities are worse than they were thirty years ago, but pro- 
port ionately few hlacks live in these worst-of-the-worsr areas. w 
Throughout the South and in much of the rest of the country, many 
black children as recently as the 1950s attended ramshackle schools 
with undertrained teachers and meager teaching materials. Any com- 
parison between the schools that most blac.ks attend now and the ones 
they attended in the 1950s favors Contemporary schools. Assuming that 


education affects cognitive capacity, the rising Investment in education 
disproportionately henefits the cognitive levels at the lower end of the 
socioeconomic spectrum. 

The argument can be repeated for puhlic Health. If nutrition, shelter, 
and health care affect intellectual development, then rising Standards 
of living are disproportionately going to show up in rising scores for the 
economically disadvantaged rather than for the upper classes. Fortravel 
and its educational henefits, the argument also applies. Not so long ago, 
many less advantaged people spent their lives within a few miles of their 
birthplaces. Today, Americans of nearly all walks of life crowd the in¬ 
te rstäte roads and the airports. Final ly, for that most Contemporary form 
of vicarious travel—the populär media—the leveling is still more dra- 
matic. The modern media can bring the world to everyone in ways that 
were once open only to the rieh. 

Because hlacks are shifted toward the lower end of the socioeconomic 
ränge, such improvements benefit them, on average, more than whites. 
If the improvements affect cognitive development, the hlack-white gap 
should have contracted. Beyond this socioeconomic leveling, there 
might also have been a leveling due to diminishing racism. The legacy 
of historic racism may still be taking its toll on cognitive development, 
but we must allow the possibility that it has tessened, at least for new 
general ions. This too might account for some narrowing of the hlack- 
white gap. 

Look INC. to THE Futljre. The question that remains is whether black 
and white test scores will continue to converge. If all that separates 
blacks from whites are environmental differences and if fertility 
pattems for different socioeconomic groups are comparable, there is 
no reason why they shouldn’t. The process would be very slow, 
however. If it continues at the pace ohserved over the last twenty 
years, then we could expect black and white SAT scores to reach 
equality sometime in the middle of the twenty-first Century, but linear 
extrapolations over such long periods are not worth muchQ 01 

If hlack fertility is loaded more heavily than white fertility toward 
knv-IQ segments of the population, then at some point convergence 
may be expect cd to stop, and the gap could even begin t o widen again. 
We take up the fertility issue in Chapter 15. A briet summary Statement 
concerning fertility patterns is that the news is not good. For now, the 
test score data leave open the possibility that convergence has already 
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stalled. For most of the tests we mentioned, black scores stopped rising 
in the mid-1980s. On the NAEP, the B/W gap actually increased from 
1986 to 1990 in all but one test group (the math test for 17-year-olds). 
On the SAT, black scores on both verbal and math parts were nearly 
flat for the five years ending in 1993, after suhstantial gains in the pre- 
ceding decade. On the ACT, however, black scores continued to rise al¬ 
ter 1986, albeit modestly.* 61 * 

One explanation for the stalled convergence on the N AEP and SAT 
is that American education stopped improving for everyone, hlacks in- 
cluded. This is consistent with the white experience on the SAT, where 
white scores have alsobeen nearly flat since the mid-1980s. But the logic 
is suspect. Just because a group at a higher mean stops improving does 
not imply that a group with a lower mean should also stop improving. 
On the contrary, pessimists can develop a case that the convergence of 
black and white SAT scores in the last two decades is symptomatic of 
what happens when education slows down toward the speed of the slow- 
est ship in the convoy. It may well be that education improves for stu- 
dents at the low end of the distribution but gets worse (or, more 
optimistically, improves less) for students at the top endJ 6 ^ lf that is the 
case, the gap between people at the low and high end of the distribution 
should narrow, but the narrowing will stop once the educational System 
completes its readjustment favoring less capable students. 

The narrowing black-white gap on the SAT looks consistent with 
some such explanation. 1631 Seen from one perspective, there is good news 
all along the Spectrum of test scores. From 1980 to 1993, the proportion 
of black test takers who scored in the 700s on the SAT-Verbal increased 
by 27 percent, for example. 1641 But such changes at the high end of the 
ränge of test scores mean little, because so small'& proportion of all black 
students were involvedJ 6 ^ 1 The real source of the black increase of 
twenty-three points in the average verbal test score from 1980 to 1995 
was a rise in the scores at the low end of the ränge. More than half (51 
percent) of the gain occurred because the proportion of black students 
scoring in the 200s dropped from 42 percent to 30 percent. 1661 In con- 
trast, less than 1 percent (0.4 percent) of the gain occurred because of 
the change in the proportion of black students scoring in the 700s. For 
the math test, 22 percent of the gain from 1980 to 1993 was accounted 
for by a drop in students scoring in the 200s; 4 percent of it was ac¬ 
counted for by an increase in students scoring in the 700s. 

Pessimists reading these data may think of an analogy with the in- 


creases in height that follow from better nutrition: Better nutririon helps 
raise the height. of children whose diets would otherwise have been in¬ 
adequate, but it does not add anything to the height of those who have 
Feen receiving a good diet already. 1671 Optimists may use the opposite 
sort of nutritional analogy: the experience of trying to lose weight. Even 
a successful diet has its plateaus, when the weight stubbornly stops Corn¬ 
ing oft for a while. A plateau is all that we are seeing in recent test data. 
Perhaps convergence will resume or even acceleratc in the near future. 

At the least, the optimists may say that it is too soon to pass judgment, 
and that seems the safest conolusion. As we reach the end of this dis- 
cussion of convergence, we can imagine the responses of readers of vary- 
ing persuasions. Many of you will be wondering why we have feit it 
necessary to qualify the good news. A smaller numher of readers who 
special ize in mental test ing may be wondering why we have given so 
much prominence to educational achievement trends and a Scattering 
of IQ results that may be psychometrically ephemeral. The answer for 
everyone is that predicting the future on this issue is little more than 
guesswork at this point. We urge upon our readers a similar Suspension 
of judgment. 

GENETICS, IQ, AND RACE 

This hrings us to the flashpoint of Intelligence as a public topic: the 
question of genetic differences between the races. Expert opinion, when 
it is expressed at all, diverges widely. In the 1980s, Mark Snyderman and 
Stanley Rothman, a psychologist and a political scientist, respectively, 
sent a questionnaire to a broad sample of 1,020 scholars, mostly acade- 
micians, whose specialties give them reason to be knowledgeable ahout 
IQ. ,66) Among the other questions, they asked, “Which of the following 
best characterizes your opinion of the heritability of the black-white dif- 
ference in IQ?” (emphasis in the questionnaire item). The answers were 
divided as follows: 

• The difference is entirely due to environmental Variation: 1 5 per¬ 
cent. 

• The difference is entirely due to genetic Variation: 1 percent. 

• The difference is a product of both genetic and environmental 
Variation: 45 percent. 
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• The data are insufficient to support any reasonable opinion: 24 
percent. 

• No response: 14 percent. 

The responses reveal the degree of uncertainty within the scientific Com¬ 
munity about where the truth lies. We have considered leaving the ge- 
netics issue at that, on grounds that no useful purpose is served by talking 
about a subject that is so inflammatory, so painful, and so far from reso- 
lution. We could have cited any numher of expert reassurances that ge- 
netic differences among ethnic groups are not worth worrying about. For 
example, a recently published textbook from which College students 
around the country are learning abotit intelligence States unequivocally 
that “there is no convincing direct or indirect evidence in favor of a ge- 
netic hypothesis of racial differences in IQ.” 60 Stephen J. Gould, whose 
Mismeasure ofMan so successfully cetnented the received wisdom about 
IQ in the media, expresses this view as confidently and more eloquently. 
“EquaUty [of the races] is not given a priori,” he once wrote in his col- 
umn for Natural History magazine. u lt is neither an ethical principle 
(though equal treatment may be) nor a Statement about norms of social 
action. It just worked out that way. A hundred different and plausible 
scenariosfor human history would have yielded other results (and moral 
dilemmas) of enormous magnitude. They just didn’t happen.”'" Hegoes 
on to make three arguments. First, the very concept oi race is illegiri- 
mate, given the extensiveness of interbreeding and the imprecise nature 
of most of the traits that people think of as being “racial.” Second, the 
division of races is recent, occurring only in the last tens or perhaps hun- 
dreds of thousands of years, limiting the amount of time that groups of 
humans could have taken separate evolutionary paths. Third, develop¬ 
ment« in genetics demonstrate that the genetic differences among hu¬ 
man beings are minor. “We now know that our usual metaphor of 
superficiality—skindeep—is literally accurate,”Gould writes.' 1 He con- 
cludes: “Say itfive times before breakfast tomorrow; more important, tin- 
derstand it as the center of a network of implication: ‘Human equality 
[i.e., equality among the races] is a contingent fact of history.’ ” l " 

Our difficulty with this position is not that Gould (or others who 
make similar arguments) is wrong about the blurred lines between the 
races, or about how long the races have been separated, or about the 
number of genes that are racially distinctive. All his facts can be true, 
and yet people who call themselves Japanese or Xhosa or Caucasians or 


Maori can still differ intellectually for genetic reasons. We may call them 
“ethnic groups” instead of races if we wish—we too are more comfort- 
able with ethnic, because of the blurred lines—but some ethnic groups 
nonetheless differ genetically for sure, otherwise they would not have 
differing skin colors or hair textures or muscle mass. They also differ in¬ 
tellectually on the average. The questkm remaining is whether the in- 
tellectual differences overlap the genetic differences to any extent. 

Our reason for confronting the issue of genetic cognifive differences 
is not to quarrel with those who deny them. If the question of genetic 
differences in cognitive ability were something that only professors 
argued about among themselves, we would happily ignore it here. We 
cannot do so, first because in the public discussion of genes and intelli- 
gence, no bürden of proot at all is placed on the innumerable public 
commentators who claim that racial differences in intelligence are 
purely environmental. This sometimes leads to a next Statement: that 
the differences are therefore inauthentic and that public policy must be 
measured against the assumption that there are no genuine cognitive 
differences between the races.The assumption of genetic cognitive 
equality among the races has practical consequences that require us to 
confront the assumption directly. 

Second, we have become convinced that the topic of genes, intelli¬ 
gence, and race in the late twentieth Century is like the topic of sex in 
Victorian England. Publicly, there seems to be nothing to talk about. 
Privately, people are fascinated by it. As the gulf widens between pub¬ 
lic discussion and private opinion, confusion and error flourish. As it 
was true of sex then, so it is true of ethnic differences in intelligence 
now: Taboos breed not only ignorance but misinformation. 

The dangers of the misinformation are compounded by the nature 
of the Contemporary discussion of race. Just beneath the surface of 
American life, people talk about race in ways that bear little resem- 
blance to the politically correct public discussion. Conducted in the 
workplace, dorm rooms, taverns, and country clubs, by people in every 
ethnic group, this dialogue is troubled and often accusatory. The Un¬ 
derground conversation is not limited to a racist minority. It goes on 
everywhere, and we believe is increasingly shaped by privately held be- 
liefs about the implications of genetic differences that could not stand 
open inspection. 

The evidence ahout ethnic differences can be misused, as many peo¬ 
ple say to us. Some readers may feel that this danger places a moral pro- 
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hibition against examining ehe cvidcncc for genetic facrors in puhiie. 
We di sagree, in part because we see even great er dangers in the current 
gulf between public pronouncements and private heliefs. And so, for 
better or worse, here are the major Strands of current thinking about the 
role of genes in cognitive differences between races . 1 ' 41 

Heritability and Group Differences 

A good place to Start is by correcting a common contusion about the 
role of genes in individuals and in groups. As we discussed in Chapter 
4, scholars accept that IQ is substantially heritahle, somewhere between 
40 and 80 pereent, meaning that much of the ohserved Variation in IQ 
is genetic. And yet this Information teils us nothing for sure about the 
origin of rhe differences between races in measured intelligence. This 
point is so hasic, and so commonly misunderstood, that it deserves env 
phasis: That a trau is genetically transmitted in individuals does not rnean that 
group differences in that trait are also genetic in origin. Anyone who douhts 
this assertion may take two handfuls of genetically identical seed corn 
and plant one handful in Iowa, the other in the Mojave Desert, and let 
nature (i.e., the environment) take its courseT The seeds will grow in 
Iowa, not in the Mojave, and the result will have norhing to do with 
genetic differences. 

The environment for American blacks has heen closer to the Mojave 
and the environment for American whites has been closer to Iowa. We 
may apply this general observation to the availahle data and see where 
the results lead. Suppose that all the ohserved ethnic differences in 
tested intelligence originate in some mysterious environmental differ¬ 
ences—mysterious, because we know from material already presenred 
that socioeconomic factors cannot be much of the explanation. We für- 
ther stipulate that one Standard deviation (fifteen IQ points) separates 
American blacks and whites and that a fifth of a Standard deviation 
(three IQ points) separates East Asians and whites. Finally, we assume 
that IQ is 60 pereent heritahle (a middle-ground estimate). Given these 
Parameters, how different would the environments for rhe three groups 
have to be in order to explain the ohserved difference in these scores? 

The ohserved ethnic differences in IQ could be explained solely by 
the environment if the mean environment of whites is 1.58 Standard 
deviations better than the mean environment of blacks and .32 Stan¬ 
dard deviation worse than rhe mean environment for East Asians, when 


environments are measured along the continuum of their capacity to 
nurture intelligence. I7hl Let s state these conclusions in percentile terms: 
The average environment of blacks would have to be at the 6rh per¬ 
centile of the distribution of environments among whites, and the av¬ 
erage environment of East Asians would have to be at the 63rd percentile 
of environments among whites, for the racial differences to be entirely 
environmental. 

Environmental differences of this magnitude and pattern are im¬ 
plausible. Recall further that the B/W difference (in standardized units) 
is smallest at the lowest socioeconomic levels. Why, if the B/W differ¬ 
ence is entirely environmental, should the advantage of the “white” en¬ 
vironment compared to the “black” he greater among the better-off and 
better-educated blacks and whites? We have not been able to think of 
a plausible reason. An appeal to rhe effects of racism to explain ethnic 
differences also requires explaining why environments poisoned by dis- 
crimination and racism for some other groups—against the Chinese or 
the Jews in some regions of America, for example—have left them with 
higher scores than the national average. 

Environmental explanations may successfully circumvenr these 
Problems, hur the explanations have to he formulated rather than 
simply assumed. Our initial objective is to warn readers who come to 
the discussion with firmly held opinions on either side. The herit- 
ahility of individual differences in IQ does not necessarily mean that 
ethnic differences are also heritahle. But those who think that ethnic 
differences are readily explained hy environmental differences haven’t 
been tough-minded enough about their own argument. At this 
complex intersection of complex factors, the easy answers are un- 
satisfactory ones. 

Reasons for Thinking that Genetic Differences M ight Be Involved 

Now we turn to some of the more technical arguments, beginning with 
those that argue for some genetic component in group differences. 

Profile Differences Between Whites ann East Asians. Races 
differ not just in average scores hut in the profile ot intellectual 
capacities. A full-scale IQ score is the aggregare of many subtests. 
There are thirteen of them in the Wechsler Intelligence Scale for 
Children (WISC-R), for example. The mosr hasic division of the 
subtests is into a verbal IQ and a performancc IQ. ln white samples, 
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the verbal and performance IQ subscores tend to have ahout rhe same 
mean, because IQ tests have been standardized on predominantly 
white populations. But individuals can have imbalances between 
these two IQs. People with high verbal abilities are likely to do well 
with words and logic. In school they excel in history and literature; in 
choosing a career to draw on those talents, they tend to choose law or 
journalism or advertising or poiitics. ln contrast, people with high 
performance IQs—or, using a more descriptive phrase, “visuospat ial 
abilities”—are likely to do well in the physical and bioiogical 
Sciences, mathematics, engineering, or other subjects that demand 
mental manipulation in the three physical dimensions or the more 
mimerous dimensions of mathematics. 

East Asians living overseas score about the same or slightly lower than 
whites on verbal IQ and substantially higher on visuospatial IQ. Even 
in the rare studies that have found Overall Japanese or Chinese IQs no 
higher than white IQs (e.g., the Stevenson study ol Japanese, Taiwanese, 
and Minnesotans mentioned earlier),' 7 the discrepancy between verbal 
and visuospatial IQ persists. For Japanese living in Asia, a 1987 review 
of the literature demonstrated without much question that the verbal- 
visuospatial differenee persists even in examinations that have been 
thoroughly adapted to the Japanese language and, indeed, in tests de- 
veloped by the Japanese rhemselvesT A study of a small sample of Ko- 
rean infants adopted into white families in Belgium found rhe familiär 
elevated visuospatial scores/ 9 

This finding has an echo in the United States, where Asian-Ameri¬ 
can students abound in engineering, in medical schools, and in gradu- 
ate programs in the Sciences, but are scarce in law schools and graduate 
programs in the humanities and social Sciences. Most people reflexively 
assume that this can he explained by language diffetences. People who 
did not speak English as their first language or who grew up in house- 
holds where English was not the language of choice choose professions 
that are not so dependent on fluent English, we often hear. But the ex- 
planation becomes less credible with every passing year, Philip Vernon, 
after reviewing the evidence on Asian-Americans, concluded that un- 
familiarity with the English language and American culture is a plausi¬ 
ble explanation only for the results of the early studies. Contemporary 
studies of Asian-Americans who are thoroughly acculturated also show 
the typical discrepancy in verbal and visuospatial abilities. American 
Indians and Inuit similarly score higher visuospatially than verbally; 


their ancestors migrated to the Americas from East Asia hundreds of 
centuries ago. The verbal-visuospat ial discrepancy goes deeper than 
linguistic background. 

Vetnon’s overall appraisal was that the mean Asian-American IQ is 
about 97 on verbal tests and about 110 on visuospatial tests. 1 * 11 Lynn’s 
1987 review of the IQ literature on East Asians found a median verbal 
IQ of 98 and a median visuospatial IQ of 106. !s2! As of 1993, for Asian- 
American students who reported that English was the first language they 
leamed (alone or with another language), the Asian-American SAT 
mean was .2 1 Standard deviation above the national mean on the ver¬ 
bal test and .43 Standard deviation above the national mean on the math 
test. Converted to an IQ metric, this amounts to a 3.3 point elevation 
of mathematical scores over verbal scores for the high IQ Asian-Amer¬ 
ican population that takes the SAT. ,si| 

Why do visuospatial abilities develop more than verbal abilities in 
people of East Asian aneestry in Japan, Hong Kong, Taiwan, mainland 
China, and other Asian countries and in the United States and else- 
where, despite the differences among the cultures and languages in all 
those countnes* Any simple socioeconomie, cultural, or linguistic ex- 
planarion is out of the question, given the diversity of living conditions, 
native languages, edueational resources, and cultural practices experi- 
enced by Hong Kong Chinese, Japanese in Japan or the United States, 
Koreans in Korea or Belgium, and Inuit or American Indians. We are 
not so rash as to assert that the environment or the culture is whnlly ir¬ 
relevant to the development of verbal and visuospatial abilities, but the 
common genetic history of racial East Asians and their North Ameri¬ 
can or European descendants on the onc hand, and the racial Europeans 
and their North American descendants, on the other, cannot plausibly 
be dismissed as irrelevant. 

Profile Differences Bftwffn Whites and Blacks. Turning now to 
blacks and whites (using these terms to refer exclusive ly to Amerieans), 
ability profiles have also been important in und erstand ing the na tu re, 
and possible genetic component, of group differences. The argument has 
been developing around what is known as Spmrmans hypothescsQ 41 This 
hypothesis says that if the B/W differenee on test scores reflects a real un- 
derlying differenee in the general mental ability, g, rhen rhe size of the 
B/W differenee will he related to the degree to which the test is saturated 
withg. |M| In other words, thebettera test measures g, the largerthe black- 
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white difference will he. Arthur Jensen began to expiore this possibility 
when he looked at the pattern of subtest scores on the WISC-R, taking 
advantage of the fact that the WISC-R has thirteen subtests, each mea- 
suring a somewhat different skill. Converting their Statistical procedures 
into a more easily understood form, here is the logic of what Arthur 
Jensen and his coauthor, Cyril Reynolds, didT 

On average, low-SES whites get lower test scores than high-SES 
whites. Rut suppose you were to go through a large set of white test scores 
from a low-SES and a high-SES group and pull out everyone with an 
overall IQ score ot, say, 105. Now you have identical scores but very dif- 
ferent SES groups. The question becomes, What does the pattem of sub- 
test scores look like? The answer is, The same. Once you equalize the 
overall IQ scores, low-SES and high-SES whites also had close-to-iden¬ 
tical mean scores on the individual subtests. 

Now do the same exercise with blacks and whites. Again, let us say 
that you pull all the tests with a full-scale IQ score of exactly 105. Again, 
you examine the scores on the subtests. But this time the pattern of sub- 
test scores is not the same for blacks and whites, even though the sub¬ 
tests add up to the identical overall score. |h?1 Despite identical overall 
scores, whites are characteristically stronger than blacks on the subtests 
involving spatial-perceptual ability, and blacks are characteristically 
stronger than whites in subtests such as arithmetic and immediare mein- 
ory, both of which involve retention and retrieval of informationT As 
Jensen and Reynolds note, the pattern of subtest differences between 
whites and blacks differs sharply from the “no differences” result 
associated with SES. This directly contradicts the hypothesis that the 
B/W difference reflects primarily SES differences.^ What accounts for 
the different subtest profiles? Jensen and Reynolds proceeded todemon- 
srrare that the results are consistent with Spearman’s hypothesis. Whites 
and blacks dif fer more on the subtests most high ly correlated with g, less 
on those leasr correlated with g. 

Since that initial study using the WISC-R, Jensen has been assem- 
bling studies that permi t further tests of Spearman’s hypothesis* He con- 
cluded from over a dozen large and representative samples ot blacks and 
whites 90 that “Spearman’s hypothesis has been borne out significantly 
by every study (i.e., 13 out of 13) and no appropriate data set has yet 
been found that contradicts Spearman’s hypothesis.” 91 There appears to 
be no dispute with his summary of the facts. lt should be noted that not 
all group differences hehave similarly. For example, deaf children often 


get lower test scores than hearing children, but the size of the difference 
is not positively correlated with the tesr’s loading on gC The phenoin- 
enon seems peculiarly concentrated in comparisons of ethnic groups. 

jensen’s most recent work on Spearman s hypothesis uses reaction 
time tests instead of traditional mental tests, hypassing many of the usual 
objections to intelligence test questions. Once again, the moreg-loaded 
the activiry is, the [arger the B/W difference is, on average. 91 Criticscan 
argue that the ent i re enterprise is mean ingiess because g is me an ing¬ 
iess, but the hypothesis of a correlation between the magnitude of the 
g-loading of a test and the magnitude of the black-white difference on 
that test has been confirmed. 1 ^ 1 

How does the confirmation of Spearman’s hypothesis bear on the ge- 
nctic explanation of ethnic differences? In plain though somewhat im- 
precise language: The broadest conception of intelligence is embodied 
in g. Anything other than g is either a narrower cognitive capacity or 
measuremenr error. Spearman’s hypothesis says in effect that as mental 
measurement focuses most specifically and reliahly on g, the observed 
black-vvhire mean difference in cognitive ability gets larger. !9S1 At the 
same time, g or other broad measures of intelligence typicallv have rel¬ 
ative ly high levels of beritabiliry. 1 ^ 1 This does not in itself Jemand a ge- 
netic explanation of the ethnic difference, but by asserting that “the 
bette? the test, the greater the ethnic difference,” Spearman’s hypothe¬ 
sis undercuts many of the environmental explanations of the difference 
that re ly cm the proposition (again, simplifying) that the apparent black- 
white difference is the result of had tests, not good ones. 

Arguments Agamst a Genetic Explanation 

The ubiquitous Arthur Jensen bas also published the clearest evidence 
that the disadvanraged environment of some blacks has depressed their 
test scores. He found that in black families in rural Georgia, the elder 
sihling typically has a lower IQ than the younger. 9 ' The larger the age 
difference is between the siblings, the larger is the difference in IQ. The 
implication is that something in the rural Georgia environment was de- 
pressing the scores of black children as they grew older. |9S| In neither the 
white families of Georgia, nor white or black families in Berkeley, 
California, are there comparable signs of a depressive effect of the 
environment. 

But demonstrating that environment can depress cognitive develop- 
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ment does not prove that the entire B/W difference is environmental, 
and in this lies an asymmetry between the contending parties in the de- 
hate. These who argue that genes might be implicated in group differ¬ 
ences do not try to argue that genes explain everything. Those who 
argue against them—Leon Kamin and Richard Lewontin are the most 
prominent—typically deny that genes have anything to do with group 
differences, a much more ambitious proposition. 

CüNFRONTIN () SPEARMAN'S HyPOTHHSLS. If one is to make this case 
against a genetic factor on psychometric grounds, the data supporting 
Spearmans hypothesis must be confronted. There are two ways to do 
so: dispute the fact itself or gram; the tact but argue that it does not 
mean what Jensen says it does. 

The most searching debate about Spearmans hypothesis was con- 
ducted in a journal that puhlishes both original scholarly works and com- 
mentaries on them, Behavioral and Bmin Sciences, where, in two separate 
issues in the latter 1980s, thirty-six experts in the relevant fields com- 
mented on Jensen’s evidenceT A number of comments were favorahle 
and provided further support for Jensen’s conclusion. Orhers were criti- 
cal, for reasons that varied from the philosophical (research into such 
hurtful issues is not useful) to the highly Technical (were Jensen’s results 
the result of varying reliabi Litios amt mg the tests?)- We summarize them 
in the notes, but the striking feature was that no commentator was able 
to d ispute the empirical claim that the racial gap in cognitive performance 
scores tends to be larger on tests or activities that draw most on g. |10C| 

Several years after the exchange on Spearmans hypothesis in Be¬ 
havioral and Bram Sciences , Jan-Eric Gustafsson presented some data 
finding a considerably smaller correlation than Jensen and others do be- 
tween g loading and B/W differences on a group of subtests. 101 It is not 
clear why Gustafsson obtained these atypical results, but, as of this writ- 
ing, they are still atypical. We have found no others for representative 
groups of blacks and whites. Our own appraisal of the Situation is that 
Jensen’s main contentions regarding Spearmans hypothesis are intaet 
and constitute a major challenge to purely environmental explanations 
of the B/W difference. 

CULTURAL EXPLANATIONS. Another approach has been taken by Jane 
Mercer, a sociologist and the developer of the System of Multicultural 
Plural istic Assessment (SOMPA). Tests are artifacts of a culture, she 
argues, and a culture may not diffuse equally into every household and 


community. ln a heterogeneous society, subcultures vary in ways that 
inevitably affect scores on IQ tests. Fewer books in the home means 
less exposure to the material that a vocabulary subtest measures; the 
varying ways of socializing children may influence whether a child 
acquires the skills, or a desire for the skills, that tests test; the 
“common knowledge” that tests supposedly draw on may not be 
common in certain households and neighborhoods. 

So far, this sounds like a Standard argument about cultural bias, and 
yet Mercer accepts the generalizations that we discussed earlier about 
internal evidence of bias. 10,1 ' She is not claiming that less exposure to 
books means that blacks score lower on vocabulary questions but do as 
well as whites on culture-free items. Rather, she argues, the effects of 
culture are more diffuse. Her argument may be seen as a variant of the 
“uniform background radiation” hypothesis that we discussed earlier. 

Furthermore, she points out, strong correlations between home or 
community life and IQ scores are readily found. In a study of 180 Latino 
and 180 non-Latino white elementary school children in Riverside, 
California, Mercer examined eight sociocultural variables: (1) mother’s 
participation in formal organizations, (2) living in a segregated neigh- 
borhood, (3) home language level, (4) socioeconomic Status based on 
occupation and education of head of household, (5) urbanization, (6) 
mother’s achievement values, (7) home ownership, and (8) intaet bio- 
logical family. She then showed that once these sociocultural variables 
were taken into account, the remaining correlation between ethnic 
group and IQ among the children feil to near zero. IOi 

The problem with this procedure lies in determining what, in fact, 
these eight variables control for: cultural diffusion, or genetic sources of 
Variation in intelligence as ordinarily understood? Recall that we 
pointed out earlier that Controlling for socioeconomic Status typically 
reduces the B/W difference by about a rhird. To the extern: that parental 
socioeconomic Status is produced hy parental IQ, Controlling for so¬ 
cioeconomic Status Controls for parental IQ. One obvious criticism of 
SOMPA is that it broadens the scope of the control variables to such 
an extent that the procedure becomes meaningless. After the correla¬ 
tions between the eight sociocultural variables and IQ are, in effect, set 
to zero, little difference in IQ remains among her ethnic samples. But 
what does this mean?The obvious possihility is that Mercer has demon- 
strated only that parents matched on IQ will produce children with sim- 
ilar IQs—not a startling finding. 
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Mercer points out that the samples differ on the sociocultural vari¬ 
ables even after Controlling for IQ. The substantial remaining correla- 
tions indicate that “important amounts of the variance in sociocultural 
characteristics [are] unexplained by IQ,” 10 * 4 evidence, she says, that they 
may be treated as substantially independent of IQ. 110,1 Rur they are, in 
fact, not independent of IQ. They remain correlated. Her basic coiv 
clusion that “there is no justification for ignoring sociocultural factors 
when interpreting between-group differences in IQ” seems to us uiv 
challengeable. 106 In the next chapter, we will present other examples of 
ethnic differences in social behavior that persist after Controlling for IQ. 
But to conclude that genetic differences are ruled out by her analysis is 
unwarranted, because she cannot demonstrate that a familys sociocul- 
tural characteristics are independent of their IQ- 107 

Scholars of Jensens school point to a number of other diff iculties wirb 
Mercer’s Interpretation. When she concludes that cultural diffusion ex- 
plains the black-white difference, the data she uses show the familiär 
pattern of Spearman’s hypothesis: The more a test loads on g, the grearer 
is the B/W difference. 10h Why should cultural diffusion manifest itself 
in such a patterned way ? Her appeal to sociocultural factors does not ex- 
plain why blacks score lower on back ward digit span than forward; why 
in Chronometrie tests, hlack movement time is faster, but reaction time 
slower, than among whites; or why the B/W difference persists on non¬ 
verbal tests such as the Ravens Standard Progressive Matrices. Ir is also 
not explained why, if the role of European white cultural diffusion (or 
the lack of it) is so important in depressing black test performance, it 
has heen so unimportant for Asians. 

A number of authors besides Mercer have advanced theories of cul¬ 
tural difference, often treated as parr of the “cultural bias” argument but 
asserting in more sweeping fashion that cultures differ in ways that will 
be reflected in test scores. In the American context, Wade Boykin is 
one of the most prominent academic advocates of a distinctive hlack 
culture, arguing that nine interrelated dimensions put blacks at odds 
with the prevailing Eurocentric. model. Among them are spirituality 
(blacks approach life as “essentially vitalistic rather than mechanistic, 
with the conviction that non-material forces influence people’s every- 
day lives”); a belief in the harmony between humankind and nature; an 
emphasis on the importance of movement, rhythm, music, and dance 
“which are taken as central to psycholog ical health”; personal styl es that 
he characrerizes as “verve” (high levels of Stimulation and energy) and 


“affect” (emphasis on emotions and expressiveness); and “social time 
perspective,” which he defines as “an orientation in which time is 
treated as passing through a social space rather than a material one.” 109 
The notes reference a variety of other authors who have made similar 
arguments. 110 All, in different ways, purport to explain how large B/W 
differences in test scores could coexist with equal predictive validity of 
the test for such things as academic and job performance and yet still 
not be based on differences in “intelligente,” broadly defined, let alone 
genetic differences. 

John Ogbu, a Berkeley anthropologist, has proposed a more specific 
version of this argument. He suggests that we look at the history of var- 
ious minority groups to understand the sources of differing levels of in- 
tellectual attainment in America. He distinguishes three types of 
minorities: “autonomous minorities” such as the Amish, Jews, and Mor- 
mons, who, while they may be victims of discrimination, are still within 
the cultural mainstream; "immigrant minorities,” such as the Chinese, 
Filipinos, Japanese, and Koreans within the United States, who moved 
voluntarily to their new societies and, while they may hegin in menial 
jobs, compare themselves favorably with their peers back in the home 
country; and, finally, “castelike minorities,” such as black Americans, 
who were involuntary immigrants or otherwise are consigned from birth 
to a distinctively lower place on the social ladder. 111 Ogbu argues that 
the differences in test scores are an outcome of this historical distinc- 
tion, pointing to a number of castes around the world—the untouch- 
ahles in India, the Buraku in Japan, and Oriental Jews in Israel—that 
have exhibited comparable problems in educarional achievement de- 
spite being of the same racial group as the majority. 

Tue Flynn EFFEtrT. Indirect support for the proposition that the observed 
B/W difference could be the result of environmental factors is provided 
by the worldwide phenomenon of rising test scores. 112 We call it “the 
Flynn effect” because of psychologist James Flynns pivotal role in focus- 
ing attention on it, but the phenomenon itself was identified in the 
1930s when testers began to notice that IQ scores often rose with every 
successive year after a test was first standardized. For example, when the 
Stanford-Binet IQ was restandardized in the mid- 1930s, it was observed 
that individuals earned lower IQs on the new tests than they got on the 
Stanford-Binet that had been standardized in the mid-1910s; in other 
words, getting a score of 100 (the population average) was harder to do 




308 The National Gontext 


Efhmc Differenccs in Cognitive Abüity 309 


on the later test. 113 This meant that the average person could answer 
more itemson the old test than the new test. Most of the ehange has been 
concentrated in the nonverbal portions of the tests. 

The rendency for IQ scores to dritt upward as a function of years since 
Standardization has now been substantiated, primarily by blynn, in tnany 
countries and on many IQ tests besides the Stanford-Riner. 114 ln some 
countries, the upward drift since World War II has been as much as a 
point a year for some spans of years. The national averages have in fact 
changed by amounts that are comparahle to the fifteen or so IQ points 
separat ing whites and blacks in America. To pur it a not her way, on the 
average, whites today may differ in IQ front whites, say, two generations 
ago as much as whites today differ from blacks today. Given their size 
and speed, the shifts in time necessarily have been due more to changes 
in the environment than to changes in the genes. 

The question then arises: Couldn’t the mean of blacks move 15 points 
as well through environmental changes.'’ There seems no reason vvhy 
not—hut also no reason to believe that white and Asian means can be 
made to stand still white the Flynn effec.t works its magic. 

There is a further question to answer: Does a 1 5-poinr IQ difference 
between grandparents and their grandchildren mean that the grand- 
children are 15 points smarter? Some experts do not believe that the 
rise is wholly, perhaps not even partly, a rise in intelligence hut in the 
narrower skills involved in intelligence test taking per se; m others 
believe that at least some of rise is in genuine intelligence, perhaps 
owing to the improvements in public education (by the schools and the 
media), health care, and nutrition. There is evidence that the rise in 
scores may be due to a contraction in the distribution of test scores 
in the population at large, with rnost of the shrinkage in the bottom half 
of the distribution. 110 ln large-scale studies of the Danish population, 
virtually all of the upward drift in intelligence test scores is accounted 
for by the rising performances of the lower half of the distribution. 11 " 
The data we presented earlier on the rise in SAT scores by American 
blacks are consistent with this story. In general, egalitarian modern 
societies draw the lower tail of the distribution closer to the mean and 
therehy raise the average. 11101 These findings accord with everyday 
experience as well. Whether one looks at the worlds of Science, litera- 
ture, politics, or the arts, one does not get the impression that the top 
of the IQ distribution is filled with more suhtle, insightful, or powerful 
intellects than it was in out grandparents’ day. 


Whatever we discover about the reasons for the upward drift in the 
mean of the distribution of test scores, two points are clear. First, a rapid 
rise in intelligence does not plausibly Stretch far into either the past or 
the tu ture. No one is suggesting, for example, that the IQ of the aver¬ 
age American in 1776 was 30 or that it will be 150 a Century from 
now. |lw ' The rising trend in test scores may already be leveling off in 
some coiintries. 1 “ 0 Second, at any point in time, it is one’s position in 
the distribution that has the most signifieant implications for social and 
economic Ute as we know it and also for the positionof one s children. 11211 

Flynn suggests that the intergenerational ehange in IQ has more to 
do with a shifting Unk between IQ scores and the underlying trait of In¬ 
telligence than with a ehange in intelligence per sc. 1122 ' Even so, the in- 
stahility of test scores across generations should caution against taking 
the current ethnic differenccs as etched in stone. There are things we 
do not yet understand about the relation between IQ and intelligence, 
which may be relevant for oomparisons not just across times but also 
across cultures and races. 

Rac:i Al. Anuistry. Just over 100 families with adopted children of 
white, black, and mixed racial ancestry are he ing studied in an ongoing 
analysis of the eftects of being raised by white adopting parents of mid¬ 
dle or higher social Status. 123 This famous transracial adoption study by 
psychologists Sandra Scarr and Richard Weinberg is the most compre- 
hensive attempt yet to separate the effects of genes and ot family envi¬ 
ronment on the cognitive development ot American blacks and whites. 
The first reports (when the children were about 7 years old) indicated 
that the black and interracial children had IQs of about 106, well above 
the national black average or the black average in Minnesota, where the 
samples were drawn, This result pointed to a considerable impact of the 
home serting on intelligence. However, a racial and adoptive ordering 
on IQ existed even in the first follow-up: The mean IQs were 117 for the 
biological children of white parents, 112 for the white adoptive children, 
109 for the adopted children with one black and one white or Asian par- 
ent, and 97 for the adopted children with two black parents. ,1 ^' Alto- 
gether, the data were important and interesting but not decisive 
regarding the source of the B/W difference. They could most easily have 
been squared with a theory that the B/W difference has both gerietic and 
environmental elements in it, but, wirb considerable straining, could 
perhaps have been stretched to argue for no genetic influence at all. 
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A follow-up a decade later, with the children in adolescence, does 
not favor the no-genetics case. 125 The new ordering of IQ means was 
109 for the biological children of white parents, 106 tor the white adop- 
tive children, 99 for the adopted children with one black parent, and 
89 for the adopted children with two black parents. 11261 The mean of 89 
for adopted children with two black parents was slightly above the na¬ 
tional black mean but not above the black mean for the North Central 
United States. The bottom line is that the gap between the adopted 
children with two black parents and the adopted children with two 
white parents was seventeen points, in line with the B/W difference cus- 
tomarily observed. Whatever the environmental impact may have been, 
it cannot have been large. 

Scarr and Weinberg continue to argue that the results are consistent 
with some form of mixed gene and environmental source of the B/W 
difference, which seems to us the most plausible conclusion. 127 Bur 
whatever the final Consensus about the data may be, the debate over the 
Minnesota transracial adoption study has shifted from an argument 
about whether the environment explains all or just some of the B/W 
difference to an argument about whether it explains more than a triv¬ 
ial part of the difference. 

Several smaller studies bearing on racial ancestry and IQ were well 
summarized almost two decades ago by Loehlin, Lindzey, and Spuhler. 12 "' 
They found the balance of evidence tipped toward some sort of mixed 
gene-environment explanation of the B/W difference without saying 
how much of the difference is genetic and how much environmental. 11M 
This also echoes the results of Snyderman and Rothman’s survey of Con¬ 
temporary specialists. 


The German Story 


One of the intriguing studies arguing against a large genetic component to 
IQ differences came about thanks to the Allied occupation of Germany 
following World War II, when about 4,000 iUegitimate children of mixed 
racial origin were born to German women. A German researcher tracked 
down 264 children of black servicemen and constructed a comparison 
group of 83 iUegitimate offspring of white occupation troops. The results 
showed no overall difference in average IQ. 128 The actual IQs of the fa- 
thers were unknown, and therefore a variety of selection factors cannot he 
ruled out. The study is inconclusive but certainly consistent with the Sug¬ 
gestion that the B/W difference is largely environmental. 


But dissenting voices can be heard m the academic world. For ex- 
arnple, a well-known book, Not in Our Genes, by geneticist Richard 
Lewontin and psychologists Steven Rose and Leon Kamin, criticizes 
anyone who even suggesrs that there may be a genetic component to 
the B/W difference or who reads the data as vve do, as tipping toward a 
mixrure of genetic and environmental influences. ni How can they do 
this? Mostly by emphasizing those aspeers ot the data that suggest envi¬ 
ronmental influences, such as the correlations between the adopting 
parents’ IQs or educational levels and rhe IQs ot their black adopted 
children in rhe Minnesota study from the first follow-up (the book was 
published betöre the second follow-up). But they have nothing to say 
about the aspects that are consistent with genetic influence, such as the 
even karger correlations between the educational level of either the bi¬ 
ological mothers or tathers and the IQs of their adopted-away black chil¬ 
dren. Although Lewonrin, Rose, and Kamin do not say it in so many 
words, their argument makes sense if it is directed at the claim that the 
B/W difference is entirely genetic. It does little to elucidare the ongoing 
scientific inquiry into whether rhe difference has a genetic component. 

We have touched on only rhe highlights of the arguments on both sides 
of the genetic issue. One main fopic we have left untouched involves 
the inalleabiliry of Intelligence, with two extremes of thought: that in- 
telligence is remarkahly unmalleable, which undercuts environmental 
arguments in general and cultural ones in particular, and that intelli- 
gence is highly malleable, supporting those same arguments. Because 
the malfeahility of intelligence is so critical a policy issue, it deserves a 
chapter of its own (Chapter 17). 

RETHINKING ETHNIC DIFFERENCES 

If the reader is now convinced that either the genetic or environmen¬ 
tal explanation has won out to the exelusion of the other, we have not 
done a sufficiently good job of presenting one side or the other. It seems 
highly likely to us that both genes and the environment have someching 
to do with racial differences. What might the mix he? We are resolutely 
agnostic on that issue; as far as we can determine, the evidence does not 
yet justify an estimate. 

We are not so naive to think that making such Statements will do 
much good. People find it next to impossible to treat erhnic differences 
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with detachment.. That there are understandable reasons for this only 
increases the need for thinking clearly and wich precision ahout what is 
and is not important. In particular, we have found that the genetic as- 
pect of ethnic differences has assumed an overwhelming importance. 
One Symptom of this is that while this book was in preparation and re- 
gardless of how we described it to anyone who asked, it was assumed 
that the book’s real subject had to he not only ethnic differences in cog- 
nitive ability but the genetic source ofthose differences. It is as if peo- 
ple assumed that we are faced with rwo alternatives: either (1) the 
cognitive difference between blacks and whites is genetic, which entaiLs 
unspoken but Jreadful consequences, or (2) the cognitive difference be¬ 
tween blacks and whites is environmental, fuzzily equated with some 
sort of culturai bias in IQ tests, and the difference is therefore tempo- 
rary and unimportant. 

But those are not the only alternatives. They are not even alterna¬ 
tives at all. The major ethnic differences in the United States are not 
the result of biased tests in the ordinary sense of rhe term. They may 
well include some (as yet unknown) genetic component, hut nothing 
suggests that they are entirely genetic. And, mosr important, it matters 
little whether the genes are involved at all. 

We have already explained why the bias argument does not readily 
explain the ethnic differences and also why we say that genes may be 
part of the story. To show why we believe that it makes next to no dif- 
ference whether genes are part of the reason for the ohserved differences, 
a thought experiment may help. lmagine that tomorrow it is discovered 
that the B/W difference in measured intelligence is entirely genetic in 
origin. The worst case has come to pass. What difference would this 
news make in the way that you approach the question of ethnic differ¬ 
ences in intelligence? Not someone eise but you. What has changed for 
the worse in knowing that the difference is genetic? Here are some hy- 
pothetical possibilities. 

Ij it were krtoum that the B/W difference is genetic , would 1 treat individ- 
ual blacks differemly from the way I would treat them if the differences were 
environmental? Prohably, human nature being what it is, some people 
would Interpret the news as a license for treating all whites as intellec- 
tually superior to all blacks. But we hope that putting this possibility 
down in words makes it obvious how illogical—besides utterly un- 
founded—such reactions would be. Many blacks would continue to be 
smarter than many whites. Ethnic differences would continue to be dif¬ 


ferences in means and distrihurions; rhey would continue to be useless, 
for all practical purposes, when assessing individuals. If you were an ern- 
ployer looking for intellectual talent, an IQ of 120 is an IQ of 120, 
wherher rhe face is black or white, let alone whether the mean differ¬ 
ence in ethnic groups were genetic or environmental. If you were a 
teacher looking at a elassroom of black and white iaces, you would have 
exactly the same Information you have now abour the probabilities that 
they would do well or poorly. 

If you were a government otficial in charge of educational expendi- 
tures and programs, you would continue to try to improve the educa- 
tion of inner-city blacks, partly out of a belief that everyone should he 
educated to the limits of his ahility, partly out of fairness io the indi- 
viduals of every degree of ability within that population—but also, let 
it be emphasized, out of a hardheaded calcularion that the net social 
and economic rerurn of a dollar spent on the elementary and secondary 
education of a Student does not depend on the heritability of a group 
difference in IQ. More generally: We cannot think of a legitimatc argument 
ivhy arrv enanmter between individual whites and blades need be affected by 
the knowledge that an aggregate ethnic difference in measured intelligence is 
genetic nistend of environmental. 

It is true that employers might under some cireumstanoes find it eco- 
nomically advantageous to use ethnicity as a crude hut inexpensive 
screen to cut down hiring costs (assuming ir were not illegal to do so). 
But this incentive exists already, by virtue of the existenee of a differ¬ 
ence in ohserved intelligence regardless of whether the difference is ge- 
neric. The existenee of the difference has many inrersecrions with policy 
issues. The source of rhe difference has none that we can think of, at 
least in the short rerm. Whether it does or not in the long term, we dis- 
cuss below. 

If the differences are genetic, arent they harder to change than if they are 
em’irunmerita/? Another common reaction, this one relies on false as- 
sumptions ahout intelligence. The underlying error is to assume rhat an 
environmentally caused deficit is somehow less hard-wired, that it has 
less impact on “real” capahilities, than does a genetically caused deficit. 
We have made this poinr before, but it hears repearing. Some kinds of 
environmentally induced conditions can be changed (lack of familiar- 
ity with television shows for a person wirhour a television sei will proh¬ 
ably he reduced hy purchasing him a television set), but there is no 
reason to think that intelligence is one of them. To preview a conclu- 
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sion vvc will documenr at length in Chapter 17, an individuals realized 
inrelligence, no matter whether realized through genes or the environ- 
ment, is not very malleahle. 

Changing cognitive ahility through environmental interventions has 
proved to he extraordinarily difficult. At best, the examples of special 
programs that have pennanently raised cognitive ahility are rare. Per- 
haps as time goes on we will leam so mach ahout the environmenr, or 
sc') mach about how intelligence develops, that eftective interventions 
can be designed. But this is only a hope. Until such advances in social 
interventions come about, which is unlikely to happen any time soon, 
it is essential to grasp the point made earlier in the book: A short per- 
son who could have been taller had he eaten hetter as a child is nonerhe- 
less real ly short. The corn planred in the Mojave Desert that could have 
flourished if it had been planted in Iowa, wasn’t planted in Iowa, and 
the res no way to rescue it when it reaches maturity. Saying that a dif¬ 
ferente is caused by the environment says nothing ahout how real it is. 

Aren't genetic dißerenccs [mssed down through the generations , while en¬ 
vironmental differences are not! Yes and no. Environmental ly caused char- 
acteristics are by detinition not heritabie in the narrow technical sense 
that they do not involve genetic transmission. But nongenetic charac- 
teristics can nonetheless run in families. For practical purposes, envi- 
ronments are heritabie too. The child who grows up in a punishing 
environment and therehy is intellectually stunted takes that deficit to 
the parenring of his children. The learning environment he encoun- 
rered and the learning environment he provides for his children rend 
to be similar. The correlation between parents and children is just that: 
a Statistical tendency for these things to be passed down, despite soci- 
etys attempts to change them, without any necessary genetic compo- 
nent. In trying to break these intergenerational links, even adoption at 
birth has its limits. Poor prenatal nutrition can stunt cognitive poten¬ 
tial in ways that cannor be remedied after birth. Prenatal drug and ab 
cohol abuse can stunt cognitive potential. These traits also run in 
families and communities and persist for generations, for reasons that 
have proved difficulr to affect. 

ln sum: If tomorrow you knew beyond a shadow oi a doubt that all 
the cognitive differences between races were 100 percent genetic in ori¬ 
gin, nothing of any significance should change. The knowledge wem Id 
give you no reason to treat individuals differently than if ethnic differ- 
ences were 100 percent environmental. By the same token, knowing 


that the differences are 100 percent environmental in origin would not 
suggesr a single program or policy that is not already being tried. It would 
justify no optimism about the time it will take to narrow the exisring 
gaps. It would not even justify confidence that genetieally hased differ¬ 
ences will not be upon us within a few generations. The impulse to think 
that environmental sources of difference are less threatening than 
genetic ones is natural but illusory. 

HOW ETHNIC DIFFERENCES FIT 1NTO THE STORY 

ln any case, you are not going to learn tomorrow that all the cognitive 
differences between races are 100 percent genetic in origin, because the 
scientific state of knowledge, unfinished as it is, already gives ample ev- 
idence that environment is part of the story. But the evidence eventu- 
ally may hecome unequivocal that genes are also part of the story. We 
are worried that the elite wisdom on this issue, for years almost hyster- 
ically in denial about rhat possihility, will snap too far in the other di- 
rection. It is possible to face all the facts on ethnic and race differences 
in intelligence and not run screaming from the room: That is the es¬ 
sential message. 

This chapter is also central to the larger themes of the book, which is 
why we ask readers who have started with Part 111 ro turn back to the In- 
troduction and hegin the long trek. ln Part I, we described the formation 
of a cognitive elite. üiven the cognitive differences among ethnic and 
racial groups, the cognitive elite cannot represent all groups equally, a 
Statement with implications rhat we will develop in Part IV. In Part II, 
we descrihed how intelligence is important for understanding the social 
Problems of our time. We limited the discussion to whites to make it eas- 
ier to think ahout the evidence without constantly having to worry 
about racism, cultural bias in the tests, or other extraneous issues. 

The material in this chapter lets us proceed. As far as anyone has 
heen able to determine, IQ scores on a proper ly administered test mean 
ahout the same rhing for all ethnic groups. A substantial difference in 
cognitive ahility distributions separates whites from hlacks, and a 
smaller one separates East Asians from whites. These differences play 
our in public and private life. In the rest of Part 111, wo may novv ex am- 
ine the relationship between social problems and IQ on a national scale. 


Chapter 14 


Ethnie Inequalities in 
Relation to IQ 


Ethnie differences in educanon, occupations, poverty , unemployment, ille- 
crime , and other signs of inequahty preoccupy scholars and thought- 
ful citizcns. ln this chapter , we examine the.se differences after cognitive ability 
is taken into account. 

We find that iMtinos and whites ofsimilar cognitive ability have similar so¬ 
cial behavior and economic outcomes. Some differences remain, and a fevu 
arc substanäal, but the overall pattem is similarity. For blacks and whites , the 
story is more complicated. On two vital indicators of success—educational 
attainment and entry into prestigious occupations—the black-white discrep- 
ancy reverses. After Controlling for IQ, larger numbers of blacks than whites 
graduate from College and enter the professions. On a third important indica- 
tor of success , wages , the black-white difference for year-round workers 
shrinks from several thousand to a few hundred dollars . 

In contrast, the B/W gap in annualfamily income or in persans belovu the 
poverty line narrows after Controlling for IQ but still remains sizable. Simi- 
larly, differences in unemployment, labor force participation, marriage, and 
ilkgitimacy get smaller but remain significant after extracting the effect of IQ . 
The.se inequalities must be explained by other factors in American life. Schol¬ 
ars have advanced nvany such explanations; we will not try to adjudicate 
among them here, except to suggest that in trying to understand the cultural, 
social, and economic sources of the.se differences, underStanding how cogni¬ 
tive ability plays into the mix of factors seems indispensable. The role of cog¬ 
nitive ability has seldom been considered in the past. Doing so in future 
research could clarify issues and focus attention on the factors that are actu- 
ally producing the more troubling inequalities . 
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A mericas pressing social problems are often portrayed in ethnic 
terms. Does the nation have an unemployment prohlem? Ir de- 
pends. Among whites in the recession year ot 1992, unemployment was 
ander seven percent, bnt it was fourteen percent among blacks. 1 
Poverty? The poverty rate in 1992 for whites was less than twelve per- 
cent but thirty-three percent for blacks. 2 Such numbers, and the debate 
over what they should mean for policy, have been at the center of Amer¬ 
ican social policy since the early 1960s. As Latinos have become a 1 arger 
portion of the population, the debate has hegun to include similar dis- 
parities berween Latinos and whites. 

Such disparities are indisputable. The question is why. Surely history 
plays a role. Open racism and institutional discrimination of less obvi- 
ous sorts have been an important part of the historical story for blacks 
and are relevant to the historical experience of Latinos and Asian- 
Americans as well. Cultural differences may also be involved. An eth¬ 
nic group with a strong Roman Catholic heritage, such as Latinos, may 
behave different ly regarding birth control and illegitimacy than one 
without that background. The tradition of filial respect in the Confu- 
cian countries may bear on the behavior of American teenagers ot East 
Asian ancestry when one looks at, for example, delinquency. 

Part 11 showed the impact of cognitive ability on poverty, illegitimacy, 
crime, and other social problems in America among whites. Chapter 1 3 
showed that the major ethnic groups in America differ, on the average, 
in cognitive ability. There is accordingly reason to ask what happens to 
ethnic differences in economic and social behavior when intelligence 
is held constant. This chapter examines that question. 

The NLSY, with its large samples of blacks and Latinos (though not 
Asians), permits us to address the question directly and in detail. We 
will show what happens to the ethnicgap on a variety of indicators when 
IQ is taken into account. To anticipate: ln some cases, large ethnic dif¬ 
ferences disappear altogether, or even reverse, with whites having the 
disadvantageous outcome compared to blacks and Latinos. ln other 
cases, substantial differences remain, even after the groups are equated 
not only for cognitive ability but for parental SES and education as well. 
We du not try to press the analysis further, to find the other reasons why 
groups may differ socially. The goal of this chapter is to broaden the 
search for answers after three decades during which scholars have ig- 
nored the contribution of IQ to ethnic differences in the main social 
outcomes of evervdav lite. 


First, we look at the indicators of success that were the focus of Part 
], then rhe indicators of problems that were the focus of Part 11. 

ETHNIC DIFFERENCES IN EDUCATION AL AND 
OCCUPATIONAL SUCCESS 

We heg in with what should be hailed as a great American success story. 
Ethnic differences in higher education, occupations, and wages are strik- 
ingly diminished after Controlling for IQ. Often they vanish. In this 
sense, America has equalized these central indicators ol social success. 

E ducatitmal At tarn ment 

The conventional view of ethnic differences in education holds that 
blacks and Latinos still lag far hehind, hased on comparisons of the per- 
centage of minorities who finish high school, enter College, and earn Col¬ 
lege degrees. Consider, for example, graduation from high school. As of 
1990, 84 percent of whites in the NLSY had gotten a high school 
diploma, compared to only 7 3 percent of blacks and 65 percent of Latinos, 
echoing national statistics. 1 * 1 But these percenrages are hased on every- 
U )dy, at all levelsof intelligence. What were the odds that a blae kor Latino 
with an IQ of 103—the average IQ of all high school graduates—conv 
pleted high school? The answer is that a youngster from either minority 
group had a higher prohahility of graduating from high school than a white, 
if all of them had lQs of 103; The odds were 93 percent and 91 percent for 
blacks and Latinos respectively, compared to 89 percent for whites. 141 

College has similarly opened up to blacks and Latinos. Once again, 
the raw differentials are large, ln national statistics or in the NLS\ sam¬ 
ple, whites are more than twice as likely to earn College degrees than ei¬ 
ther blacks or Latinos. H The average IQ of all College graduates was, 
however, ahout 1 14. What were the odds that a black or Latino with an 
IQ of 114 graduated from College? The figure helow shows the answers. 

All the graphics in this chapter follow the pattem of this one. Ehe 
top three bars show rhe prohahilities ot a particular outcome—College 
graduation in this case—by ethnic group in the NLSY, given the aver¬ 
age age of the sample, which was 29 as of the 1990 interview. In this fig¬ 
ure, the top three hars show that a white adult had a 27 percent chance 
of holding a bachelor’s degree, compared to the lower odds for blacks 
(1 1 percent) and Latinos (10 percent). The prohahilities were computed 
through a logistic regression analysis. 



320 The National Context 


Ethnie /nequaliries in Relation to IQ 321 


After Controlling for IQ, the probability of graduating from College 
is about the same for whites and Latinos, higher for blacks 


The probability of holding a bachelor’s degree 


For a person of average age (29) before coniwlling for IQ 
White 

Black 11% | 

Latino 10%| 


For a person of average age and average IQ 
for College graduates (114) 

White_ 50% 

Black 68% 

Latino 49% 
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The lower set of bars also presents the probabilities hy ethnic group, 
bvit with one big difference: Now, the equation used to compute the 
probability assumes that each of these young adults has a certain IQ 
level. In th is case, the computation assumes that everyhody has the av¬ 
erage IQ ot all College graduates in the NLSY— a little over 114. We 
find that a 29-year-old (in 1990) with an IQ of 114 had a 50 pereent 
chance of having graduated from College if white, 68 pereent if black, 
and 49 pereent if Latino. After taking IQ into account, blacks have a 
hetter record of earning College degrees than either whites or Latinos. 
We discuss this hlack advantage in Chapter 19, when we turn to the ef- 
feets of affirmative act ion. 

Occupational Status 

One of the positive findings about ethnic differences has heen that ed- 
ucation pays off in occupational Status for minorities roughly the same 
as it does for whites* This was reflected in the NLSY as well: Holding 
education constant, similar proportions of blacks, Latinos, and whites 
are found in the various occupational categories. 1/1 


To what extent does Controlling for IQ produce the same result? We 
know from Chapter 2 that occupations draw from different segments of 
the cognitive ability distribution. Physicians come from the upper part 
of the distribution, unskilied laborers from the lower part, and so forth. 
If one ethnic group has a lower average IQ than another ethnic group, 
this will he reflected in their occupations, other things equal. What 
would the occupational distributions of different ethnic groups he after 
taking cognitive ability into account? 

Sociologist Linda Gottfredson has examined this question for blacks 
and whites.'* If, for example, black and white males were reeruited with- 
out discrimination into careers as physicians ahove a cutofl of an IQ of 
112 (which she estimates is a fair approximation to rhe lower hound for 
the actual population of physicians), the difference in the qualifying 
population pools would place the black-white ratio at about .05—about 
one hlack doctor for every twenty white ones. According to census data, 
the actual per capita ratio of black to white male physicians was ahout 
. 3 in 1980, which is about six black doctors for every twenty white ones. 
Another example is secondary school teaching, for which a similar cal- 
culation implies one hlack high school teacher for every ten white ones. 
The actual per capita ratio in 1980 was instead about six black reach- 
ers for every ten white ones. ln both examples, there are ahour six times 
as many blacks in the occupation as there would he if selection by cog¬ 
nitive ability scores were strictly race blind. Gottfredson made these cal- 
culations for occupations spanningmosr of the ränge of skilIed johs, from 
physician and engineer at the top end to truck driver and meat cutter 
at the low end. She concluded that blacks are overrepresented in almost 
every occupation, but most of all for the high-status occupations like 
medicine, engineering, and teachingQ 1 

We confirm Gottfredson s conclusions with data from the NLSY hy 
going hack to the high-IQ occupations we discussed in Chapter 2: 
lawyevs, physicians, dentists, engineers, College teachers, accountants, 
architects, chemists, Computer scientists, mathematicians, natural sei- 
entists, and social scientists. Grouping all of these occupations together, 
what chance did whites, blacks, and Latinos in the NLSY have of en¬ 
tering them? The figure below shows the results. 

Before Controlling for IQ and using unrounded figures, whites were 
almost twice as likely to be in high-lQ occupations as blacks and more 
than half again as likely as Latinos.* 101 But after Controlling (or IQ, the 
picture reverses. The chance of entering a high-lQ occupation for a 
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After Controlling for IQ, blacks and Latinos have substantially 
higher probabilities than whites of being in a high-IQ occupation 



black with an IQ of 117 (which was the average IQ of all the people in 
these occupations in the NLSY sample) was over twice the proportion 
of whites with the same IQ. Latinos with an IQ of 117 had itiore than 
a 50 percent higher chance of entering a high-IQ occupation than 
whites with the same IQ. 1111 This phenomenon applies across a wide 
ränge of occupations, as discussed in more detail in Chapter 20. 

Wages 

We come now to what many people consider the true test of economic 
equality, dollar income. Two measures of income need to he separated 
because they speak to different issues. Wages provides a direct measure of 
how mach a person gets per unit of time spent on the job. Annualfamily 
income reflects many other factors as well, being affected hy mariral Sta¬ 
tus (does the family have two incomes?), nonwage income (from stock 
dividends to welfare), and the amount of time spent earning wages (did 
the person have a job for all fifty-two weeks of the year?)- We hegin with 
wages, the measure that most directly reflects the current workplace. 

As of 1989, white year-round workers (of average age) in the NLSY 
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sample (men and women) made an average of $6,378 more than blacks 
and $3,963 more than Latinos. 1121 The figure below shows what happens 
Controlling for intelligence, this time presenting the results for a year- 

After Controlling for IQ, ethnic wage differentials shrink 
from thousands to a few hundred dollars 

Annual wages for a year-round worker, 1989 


For a person of average age (29) before Controlling for IQ 



For a person of average age and average IQ (100) 
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round worker with an IQ of 100. The average black who worked year- 
round was making less than 77 percent of the wage of the average env 
ployed white. 1131 After Controlling for IQ, the average black made 98 
percent of the white wage. For Latinos, the ratio after Controlling for IQ 
was also 98 percent of the white wage. Another way to summarize the 
outcome is that 91 percent of the raw black-white differential in wages 
and 90 percent of the raw Latino-white differential disappear after Con¬ 
trolling for IQ. 

These results say that only minor earnings differences separate 
whites, blacks, and Latinos of equal IQ in the NLSY. 14 Because this find- 
ing is so far from what the public commentary assumes, we explore it 
further. We focus on the Situation facing blacks, because the black-white 
disparities have been at the center of the political debate. Parallel analy- 
ses for Latinos and whites generally showed smaller initial income dis¬ 
parities and similar patterns of convergence after Controlling for IQ. 
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Our finding that wage differentials nearly disappear may be a surprise 
especially in light of the familiär conclusion that wage disparities per- 
sist even for blacks and whites with the same education. For example, 
in the 1992 national data collected by the Bureau of the Census, me¬ 
dian earnings of year-round, fulhtime workers in 1992 were $41,005 for 
white male graduates with a hachelobs degree and only $31,001 for black 
males with the same degreeO Similar disparities occur all along the ed- 
ucational ränge. The same pattem is found in the NLSY data. Even uf- 
ter Controlling for education, hlacks in the NLSY still earned only 80 
percent of the white wage, which seems to make a prima facie case for 
persistent discrimination in the labor market. 

Blacks and whites who grow up in similar economic and social cir- 
cutnstances likewise continue to differ in their earning power as adults. 
This too is true of the NLSY data. Suppose we conrrol for three fac- 
tors—age, education, and socioeconomic Background—thar are gener- 
ally assumed to influence people’s wages. The result is that hlack wages 
are still only 84 percent of white wages, again suggescing continuing 
racial discrimination. 

And yet Controlling just for IQ, ignoring both education and socioe¬ 
conomic Background, raises the average black wage to 98 percent ot the 
white wage and reduces the dollar gap in annual earnings from wages 
for year-round workers to less than $600. A similar result is given as the 
hottom row in the following rahle, this time extracting as well the ef- 
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fects of different occupational distributions hetween whites and blacks. 
The rovvs above it show what happens when separate wages are com- 
put cd for different occupational groupings. 

The tuble contains a number of noteworthy particulars, but the most 
interesting result, which generalizes to every occupational category, is 
how little difference education makes. A common complaint about 
wages is that they arc artificially affected by credentialLsm. If credentials 
are important, then educational differences between blacks and whites 
should account for inuch of their income differences. The table, how- 
ever, shows that knowing the educational level of blacks and whites does 
little to explain the difference in their wages. Socioeconomic Back¬ 
ground also falls to explain much of the wage gaps in one occupation 
after another. Thal hrings us to the final column, in which IQs are con- 
rrolled while education and socioeconomic Background are left to vary 
as they will. The black-white income differences in most of the occu- 
pations shrink considerahly. Altogether, the table says that an IQ score 
is more important—in most cases, much more important—in cxplain- 
ing black-white wage differences than are education and socioeconomic 
hackground for every occupational category in it. 

Analyzing the results in detail would require much finer breakdowns 
than the ones presented in the table. Why is rhere still a meaningful dif¬ 
ferential in the managers/administrators category after Controlling for 
IQ? Why do blacks earn a large wage premium over whites of equiva¬ 
lent age and IQ in clerical and Service Jobs? The explanations could 
havesomething todo with ethnic factors, but the varietiesof jobs within 
diese categories are so wide that the differentials could reflect nothing 
more than different ethnic distributions in specific jobs (for example, 
ihe managers/administrators category include jobs as different as a top 
executive at GM and the shift manager of a McDonalds; the Service 
workers category includes both police and bushoys). We will not try to 
conduct those analyses, though we hupe others will. At the level rep- 
resented in the table, it looks as if the job market rewards blacks and 
whites of equivalent cognitive ability nearly equally in almost every job 
category. 

Although we do not attempt the many analyses that might enrich 
this Basic conclusion, one other factor—gender— is so obvious that we 
must mention it. When gender is added to the analysis, the black-white 
differences narrow by one or two additional percentage points for each 
of the comparisons. In the case of IQ, this means that the racial differ- 
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ence disappears altogether. Controlling for age, IQ, and gender (ignor- 
ing education and parental SES), the average wage for year-round black 
workers in the NLSY sample was 101 percent of the average white wage. 

Annual Income and Poverty 

We turn from wages to the broader question of annual family income. 
The overall family income of a 29-year-old in the NLSY (who was not 
still in school) was $41,558 for whites, compared to only $29,880 for 
blacks and $35,514 for Latinos. Controlling for cognitive ability shrinks 
the black-white difference in family income from $11,678 to $2,793, a 
notable reduction, but not as large as for the wages discussed ahove: 
black family income amounted to 93 percent of white family income af- 
ter Controlling for IQ. Meanwhile, mean Latino family income after 
Controlling for IQ was slightly higher than white income (101 percent 
of the white mean). The persisting gap in family income between blacks 
and whites is reflected in the poverty data, as the figure below shows. 
Controlling for IQ shrinks the difference between whites and other eth- 
nie groups substantially but not completely. 


Controlling for IQ cuts the poverty differential by 
77 percent for blacks and 74 percent for Latinos 


The probability of being in poverty 
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lf comme.ntators and public policy specialists were looking at a 6 per¬ 
cent poverty rate for whites against 11 percent for blacks—the rates for 
whites and blacks with IQs of 100 in the lower portion of the graphic— 
their conclusions might differ from what they are when they see the un- 
adjusted rates of 7 percent and 26 percent in the upper portion. At the 
least, the ethnic disparities would look less grave. But even after Con¬ 
trolling for IQ, the black poverty rate remains almost rwice as high as 
the white rate—still a significant difference. 1161 Why does this gap per- 
sist, like the gap in total family income, while the gaps in educational 
attainment, occupations, and wages did not? The search for an answer 
takes us successively further from the things that IQ can explain into 
ethnic differenc.es with less well understood rootsT 


ETHNIC DIFFERENCES ON IND1CATORS OF SOCIAL 
PROBLEMS 

Ethnic differenc.es in poverty persist, albeit somewhar reduced, after 
Controlling for IQ. Let us continue with some of the other signs of so- 
cial maladjustment that Part II assessed for whites alone, adding ethnic 
differences to the analysis. We will not try to cover each of the indica- 
tors in those eight chapters (Appendix 6 provides much of that detail), 
but it may be instructive to look at a few of the most important ones, 
seeing where IQ does, and does not, explain what is happening behind 
the scenes. 

Unemployment and Labor Force Participation 

Black unemployment has heen higher than white unemployment for as 
long as records have heen kept—more than twice as high in 1992, typ- 
ical of the last twenty years. 18 Once again the NLSY tracks with the na¬ 
tional statistics. Restricting the analysis to men who were not enrolled 
in school, 21 percent of blacks spent a month or more unemploycd in 
1989, more than twice the rate of whites (10 percent). The figure for 
Latinos was 14 percent. Controlling for cognitive ability reduces these 
percentages, but. differently for blacks and Latinos. The difference be- 
tween whites and Latinos disappears altogether, as the figure below 
shows; that between whites and blacks narrows but: eines not disappear. 
Black males with an IQ of 100 could expect a 15 percent: chance of be¬ 
ing unemploycd for a month or more as of 1989, compared with an 11 
percent chance for whites. Dropping out of the lahor force is similarly 
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After Controlling for IQ, the ethnic discrepancy in 
male unemployment shrinks by more than half for 
blacks and disappears for Latinos 

The prohability of being unemployed for a month or more 
For a person of average age (29) before controlling for IQ 
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related to IQ. Controlling for IQ shrinks the disparity between blacks 
and whites by 65 percent and the disparity between Latinos and whites 
by 73 percent. 1191 

Scholars are discussing many possible explanations of the poorer job 
outcomes for black males, some of which draw on the historical experi- 
ence of slavery, others on the nature of the urbanizing process follow- 
ing slavery, and still others on the structural shifts in the eeonotny in 
the 1970s, but ethnic differences in IQ are not often included among 
the possibilities. 20 Racism and other historical legacies may explain why 
controlling for IQ does not eliminate differences in unemployment and 
dropping out of the labor force, but, if so, we would be left with no ev- 
ident explanation of why such factors are not similarly impeding the 
equalization of education, occupational selection, or wages, once IQ is 
taken into account. With the facts in hand, we cannot distinguish be¬ 
tween the role of the usual historical factors that people discuss and the 
possibility of ethnic differences in whatever other personal Attributes 
besides IQ determine a person’s ability to do well in the job market. We 
do not know whether ethnic groups differ on the average in these other 
ways, let alone why they do so if they do. But to the extern that there 
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are such differences, controlling for IQ will not completely wash out the 
disparities in unemployment and labor force participation. We will not 
speculate further along these lines here. 

Marriage 

Historically, the black-white difference in marriage rates was small un- 
til the early 1960s and then widened. By 1991, only 38 percent of black 
women ages 15 to 44 were married, compared to 58 percent of white 
women. 1211 In using the NLSY, we will limit the analysis to people who 
had turned 30 by the time of the 1990 interview. Among this group, 78 
percent of whites had married before turning 30 compared to only 54 
percent of blacks. The white and Latino marriage rates were only a few 
percentage points apart. When we add cognitive ability to the picture, 
not much changes. According to the figure below, only 8 percent of the 
black-white gap disappears after controlling for IQ, leaving a black with 
an IQ of 100 with a 58 percent c.hance of having married by his or her 
thirtieth birthday, compared to a 79 percent chance for a white with the 
same IQ. 

The reasons for this large difference in black and white marriage have 
been the subject of tntense debate that continues as we write. One 

Controlling for IQ explains little of the large 
black-white difference in marriage rates 
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school of thoLight argues that structural unemployment has reduced the 
numher of marriageable men for black women, but a growing body of 
information indicates that neither a shortage of black males nur so- 
cioeconomic deprivation explains the bulk of the hlack-white disparity 
in marriage. 1221 As we have just demonsrrated, neither does IQ explain 
much. For reasons that are yet to be fully understood, black America 
has taken a markedly different stance toward marriage than white and 
Latino America. 

lllegitimacy 

A significant difterence between hlacks and whites in illegitimate births 
goes hack at least to the early part of this Century. As with marriage, 
however, the ethnic gap has changed in the last rhree decades. In 1960, 
24 percent of black children were illegitimate, compared to only 2 per- 
cent of white children—a huge proportional difference. But birth 
within marriage remained the norm for both races. By 1991, the ligures 
on illegitimate births were 68 percent of all births for hlacks compared 
to 39 percent for Latinos and 18 percent for noivLarino whites.The 
proportional difference had shrunk, but the widening numerical differ¬ 
ence between hlacks and whites had led to a Situation in which births 
within marriage were no longer the norm for hlacks, while they re¬ 
mained the norm (though a deteriorating one) for whites. 

The hlack-white dtspariry in the NLSY is consistent with the na¬ 
tional statistics (although somewhat lower than the latest figures, he- 
cause it encompasses births from the mid-1970s to 1990). As of the 1990 
interview wave, the probabilities that a child of an NLSY woman would 
be born out of wedlock (Controlling for age) were 62 percent ior hlacks, 
2 3 percent for Latinos, and 12 percent for non-Latino whites. As far as 
we are ahle to determine, this disparity cannot be explained away, no 
matter what variables are entered into the equation. The figure below 
shows the usual first Step, Controlling for cognitive ability. 

Controlling for IQ reduced the Latino-white difference hy 44 per- 
cent but the hlack-white difference by only 20 percent. Nor does it 
change much vvhen we add the other factors discussed in Chapter 8: 
socioeconomic Background, poverty, coming from a Broken ho me, or 
education. No matter how the data are sliced, black women in the N LSY 
(and in every other representative database that we know of) have a 
much higher proportion of children out of wedlock than either whites 
or Latinos. As we write, the debate over the ethnic disparity in illegit- 
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Controlling for IQ narrows the Latino-white difference in 
illegitimacy but leaves a large gap between blacks and whites 



For a mother of average age and average IQ f 100) 



i i i i i i i i 

0% 10% 20% 30% 40% 50% 60% 70% 


imacy remains as intense and as far from resolution as ever. 24 We can 
only add that ethnic differences in cognitive ability do not explain much 
of it either. 

Welfare 

As of 1991, about 21 percent of black women ages 15 to 44 were on 
AFDC nationwide, compared to 12 percent of Latino women and 4 per¬ 
cent of white women (including all women, mothers and nonmoth- 
ers). 2S The NLSY permits us to ask a related question that extends hack 
rhrough time: How many of the NLSY women, ages 26 to 3 3 as of 1990, 
had ever been on welfare? The answer is that 49 percent of black women 
and 30 percent of all Latino women had been on welfare at one time or 
another, compared to 13 percent of white women. 1201 The figure shows 
the effects of Controlling for IQ. 

Adding cognitive ability explains away much of the disparity in web 
fare recipiency among hlacks, whites, and Latinos. In the case of Lati- 
nos, where 84 percent of the difference disappears, the remaining 
disparity with whites is about three percentage points. The disparity be¬ 
tween blacks and whites—30 percent of black women receiving wel- 
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Controlling for IQ cuts the gap in black-white welfare 
rates by half and the Latino-white gap by 84 percent 

The probability that a woinan has ever been on welfare 
(all women, mothers and non-mothers) 

For a woman of average age (29) before Controlling for IQ 
Whites 13% I 


Blacks 

49% 

Latinos 

30% | 


For a woman of average age and average IQ { WO) 
Whites 12% | 


Blacks 

30% | 

Latinos 

15% | 
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fare, compared to about 12 percent for whites—is still large hut only half 
as large as the difference not adjusted for IQ. 

This is as much as we are able to explain avvay. When we probe für- 
ther, IQ does not do more to explain the black-white difference. For ex- 
ample, we know that poverty is a crucial factor in determining wherher 
women go on welfare. We therefore explored whether IQ could explain 
the black-white difference in a particular group ot women: rhose who 
had had children and had been below the poverty line in the year prior 
to birth. The results of the analysis are shown in the figure below. 
Among women who were poor in the year prior to birth, the black-white 
difference is slightly larger after Controlling for IQ, not smaller. These 
data, like those on illegitimacy and marriage, lend Support to the Sug¬ 
gestion that blacks differ from whites or Latinos in their likelihood of 
being on welfare for reasons that transcend both poverty and IQ, for rea- 
sons that are another subject of continuing debate in the literature. 1 ' 71 

L ow'Birth'Weight Babies 

Low birth weight, defined as infants weighing less than 5.5 pounds at 
birth, is predictive of many subsequent difficulties in the physical, so¬ 


Even among poor mothers, Controlling for IQ does not 
diminish the black-white disparity in welfare recipiency 


The probability that a poor mother has ever been on welfare 

For a poor mother of average age (29) before contra! ling for IQ 
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r of average age and average IQ (100) 
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cial, and cognitive development of children. Historically, blacks have 
had much higher rates of low birth weight than either Latinos or whites. 
ln the most ree ent reporting year (1991) tor national data, almost four- 
teen percent. of all black babies were low 7 birth weight, compared to five 
percent of white babies and six percent of Latino bahiesQ ln our analy- 
ses of the NLSY data, we focus on habies who were low hirth weight rel¬ 
ative to the length of gestation, excluding prematurc babies who were 
less than 5.5 pounds hut were appropriate for gestational age using the 
Standard pediatric definitionQ Using unrounded data, the rate of low- 
hirrh-weight hirths for blacks (10 percent) was 2.9 times as high as for 
whites. The Latino rate was 1.5 times the white rate. The figure shows 
whar happens after Controlling for IQ. The black rate, given an IQ of 
100, drops from 10 percent to 6 percent, substantially closing the gap 
with whites. 1 * 01 The Latino-white gap remains effectively unchanged. 

Children Living in Poverty 

ln 1992, 47 percent of black children under the age of 18 were living 
under the poverty linc. This extraordinarily high figure was nearly as 
had for Latino children, with 40 percent under the poverty line. For 
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Controlling for IQ cuts the black-white disparity 
in low-birth-weight babies by half 

The probability of giving birth to a low-birth-weight baby 
For a mother of average age (29) before Controlling for IQ 
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non-Latino whites, the proportion was about 14 percent. 1111 ln ap- 
proaching this issue through the NLSY, we concentrated on very young 
children, identifying those vvho had lived in families with incomes he- 
low the poverty line throughout their first three years of life. The re- 
sults, before and after Controlling for IQ, are shown in the upper figure 
on the next page. Given a mother with average IQ and average age, the 
probability that a black child in the NLSY lived in poverty throughout 
his first three years was only 14 percent, compared to an uncorrected 
black average of 54 percent. The reduction for Latinos, from 30 percent 
to 10 percent, was also large. The proportional difference hetween mi- 
norities and whites remains large. 12 

The Child’s Harne Environment 

We now turn to the measure of the home environment, the HOME in- 
dex, described in Chapter 10. For this and the several other indexes usecl 
in the assessment of NLSY children, we follow our practice in Chapter 
10, focusing on children at the bottom of each scale, with bottom op- 
erationally defined as being in the bottom 10 percent. 

The disparities in low HOME index scores between whites and 
minorities were large (see rhe lower figure on the next page). It was 
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Controlling for IQ reduces the discrepancy between minority and 
white children living in poverty by more than 80 percent 


The probability oi'a child living in poverty tor the tirsl three years 


Born to a mother average age (29) before controlhng for IQ 
Whites 9% | 


Latinos 30% 


Born to a mother of average age and average IQ f 100) 
Whites 6%| 


Blacks 14 % 


Latinos 
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Controlling for IQ cuts the ethnic disparity in home environments 
by half for blacks and more than 60 percent for Latinos 

The probability of being in the bottom dccile on the HOME index 

Born to u person of averafic a<;e (2V) before eontrolUn# for IQ 
Whites 7% I 


Latinos 


Born to a person of average age and average IQ {100) 
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substantially reduced, by 52 percent for blacks and 64 percent for Lati¬ 
nos, but the black rate remained well over twice the white rate and the 
Latino rate close to twice the white rate.' 1 

Indicators of the Child’s Development 

Details on the several indexes of child development presenred in Chap- 
ter 10 may he found in Appendix 6. We summarize them here hy show- 
ing the proportion of children who showed up in the bottom deale of 
any of the indexes. 

As the figure below shows, the ethnic disparities were not great even 
before Controlling for IQ, and they more than disappeared after Con¬ 
trolling for IQ. We leave this finding as it Stands, but it ohviously raises 

Controlling for IQ more than eliminates Overall 
ethnic differences in the developmental indexes 

The probability that a child was in the bottom decile of 
one or more of the developmental indexes 

Born to a mother of average age (29) before Controlling for IQ 
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a numher of issues. Since these indexes are based primarily on the moth- 
ers’ assessments, it is possihle that women of different ethnic groups use 
different reference points (as has heen found on ethnic differences in 
°ther self-report measures).* 4 It is also possihle that the results may be 
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taken at face value and that minority children with mothers of similar 
age and IQ do better on developmental measures than white children, 
which could have important implications. Pilling out this story lies be- 
yond the scope of our work, but we hope it will be taken up by others.' s 

Intellectual Development 

We will discuss this topic in more detail in Chapter 15 as we present the 
effects of differential fertility across ethnic groups. The figure below 
shows the children of NLSY mothers who scored in the bottom decile 
on the Peabody Picture Vocabulary Test (PPVT) hased on mxtional norms , 
not the bottom decile of children within the NLSY sample. Control- 

Based on national norms, high percentages of minority children 
remain in the bottom decile of IQ after controlling for the 
mother’s IQ 

The probability that a child is in the bottom decile of the PPVT 
(based on national norms) 

Born to a person of average age (29) before controlling for IQ 
Whitex 7%| 

Blacks 

Latinos 

Born to a person of average age and average IQ (WO) 
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Blacks 
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ling for the mothers IQ reduces ethnic disparities considerahly while 
once again leaving a broad gap with whites—in this case, roughly an 
cqual gap between whites and both blacks and Lat intos. The point that 
Stands out, however, is the extremely large proportion of minority NLSY 
children who were in the bottom decile of the PPVT—in effect, me an- 
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ing an IQ of 80 or lower—when national norms are applied. This is one 
of the reasons for concern about fertility that we discuss in Chapter 1 5. 

Crime 

In the national data, blacks are about 3.8 times more likoly to he ar- 
rested relative to their numbers in the general population than whites 
(Latino and non-Latino whites are comhined in this comparison). 11 
Blacks are also disproportionately the victiins of crime, especially vio- 
lent crime. The ratio of black homicide victiins to white as of 1990 was 
7.7 to 1 for men and 4.8 to 1 for women.’ 7 

Sociologist Robert Gordon has analyzed black-white differences in 
crime and concluded that virtually all of the difference in the preva- 
lence of black and white juvenile delinquents is explained by the IQ dif¬ 
ference, independent of the effect of socioeconomic Status.*’' The only 
reliahle indicator from the NLSY that lets us compare criminal behav- 
i°r across ethnic groups is the percentage of young men who were ever 
interviewed while incarcerated.' 1)| The figure below shows the Standard 
compaiison, hefore and after Controlling for cognitive ability. Among 
white men, the proportion interviewed in a correctional facility after 



Ethnic Inequalities in Relation to IQ 339 

Controlling for age was 2.4 percent; among black men, it was 13.1 per- 
cent. This large black-white difference was reduced by almost three- 
quarters when IQ was taken into account. The relationship of cognitive 
ability to criminal behavior among whites and blacks appears to be sim- 
ilar. 40 As in the case of other indicators, we are lett with a nontrivial 
black-white difference even after Controlling for IQ, but the magnitude 
of the difference shrinks dramatically. 

The Middle Class Values Index 

We concluded Part II with the Middle Class Values (MCV) Index, 
which scores a “yes” for those young adults in the NLSY who were still 
married to their first spouse, in the labor force if they were men, bear- 
ing their chiLiren within marriage if they were women, and staying out 
of jail, and scores a “no” for those who failed any of those criteria. Never- 
married people who met all the other critena were excluded. The MCV 
Index, as unsophisticated as it is, has a serious purpose: It captures a set 
of behaviors that together typify (though obviously donot define) “solid 
citizens.” Having nvany such citizens is important for the creation of 
peacetul and prosperous communities. The figure below shows what 

The MCV Index, before and after Controlling for IQ 

The probability of scoring “yes” on the 
Middle Class Values Index 

For a per non of average age (29) before con troll ing for IQ 
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happens when t he MCV Index is applied to different ethnic groups, first 
adjusting only for age and then Controlling for IQ as well. (In Inter¬ 
pret ing rhese data, bear in mind that large numhers of people of all eth- 
nicities who did not score “yes” are leading virtuous and productive 
lives.) The ethnic clisparities remain instructive. Refore Controlling for 
IQ, large clisparities separate both Latinos and hlacks from whites. ßut 
given average IQ, the Latino-white difterence shrank tu rhree percent- 
age points. The difterence hetween hlacks and whites and Latinos re- 
mains substantial, though only ahout halt as large as it was betöre 
Controlling tor IQ. This outcome is not surprising, given what we have 
already shown about ethnic. differences on the indicators that go into 
the MCV Index, hui it nonetheless points in a summary fashion to a 
continuing divergence hetween hlacks and the rest ot the American 
population in some basic social and economic behuviors. 

A MORE REALIST1C VIEW OF ETHNIC DISPARITIES IN 
SOCIAL AND ECONOMIC INDICATORS 

It one of Americas goals is to rid itself ot racism and institutional dis- 
criminatioiv, then we should welcome the finding that a Latino and 
white of similar cognirive abiliiy have the same chances ot getting a 
hachelor’s degree and working in a white-collar joh. A black with the 
same cognirive abiliiy has an even higher chance than either the Latino 
or white of having those good things happen. A Latino, black, and white 
of similar cognitive ability earn annual wages with in a few hundred dob 
lars ot one another. 

Some ethnic differences are not washed away by Controlling either 
tor intelligence or for any other variables that we examined. We leave 
those remaining differences unexplained and look torward to learning 
from our colleagues where the explanations lie. We urge only thai they 
explore those explanations after they have extracted the role—offen 
the large role—that cognitive ability plavs. 

Similarly, the evidence presented here should give everyone who 
writes and talks ahout ethnic inequalities reason to avoid flamboyant 
rhetoric ahout ethnic oppression. Racial and ethnic differences in this 
country are seen in a new light when cognitive ability is added to the 
picture. Awareness of these relationships is an essential first Step in try- 
ing to construct an equitable America. 


Chapter 15 

The Demography of Intelligence 


When people die , they are not replaced < me for one hy hahies who will develop 
idemical IQs . // the new hahies grow u[> to hmv systcmatically higher or lower 
IQs than che people who die, the national distrihution of intelligence changes. 
Mountm# evidence indicates that demographic trends are exerting dowmvard 
pressure on the distrihution oj cognitive ability in the United States and that 
the Inessures are strnng enough io have social consequences. 

Throughout the West, modernization has hrought falling hirth rates. The 
mtt’.s fall faster for educated women than the uneducated. Because education 
is so dasely linked with cognitive ability, this cends to produce a dysgenic ef- 
fect, or a dotemcard shift in the ability distrihution. Furthermore, education 
leails women to have their hahies later-—which ahme also pioduces additional 
dysgenic pressures. 

The Professional consensus is that the United States has experienced dys- 
genic pressures throughout either most of the Century (the optimists) or all of 
the Century (the pessimists). Women of all races and ethnic groups follow this 
pattem in similar fashion. There is some evidence that hlacks and Latinos are 
experiencing even more severe dysgenic pressures than whites, which could lead 
to further divergence hetween whites and other groups in future generations. 

The rulcs that currently govem Immigration (rrnvide the other majur source 
of dysgenic pressure. Itappears t hat the mean IQ of immigrants in the 1 980s 
works out to ahout 95. The low IQ may not he a jrrohlem; in the post, im- 
migrants have stmetimes shown large increases on such measures. But other 
evidence mdicates that the self-selection jrrocess that used to attract the clas¬ 
sic American Immigrant — brave , hard working, imaginative, self-starting, 
and often of high IQ—has heen changing, and with it the nature of some of 
the immigrant population. 

Putdng the pieces together, something worth worrying ahout is hnppening 
to the cognitive capital of the country. Improved health, education, and child- 
hood interventions may hide the denwgraphic effects, Hut that Joes not reduce 
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their importance. Whatever good things we can accomplish with changes in 
the environment would be that much more effective if they did not have to fight 
a demographic head wind. 


S o far, we have heen treating the distribution of intelligence as a fixed 
entity, But as the population replenishes itself from generation to 
generation by birth and Immigration, the people who pass from the 
seene are not going to he replaced, one for one, by other people with 
the same IQ scores. This is what we mean by the demography of intel- 
ligence. The question is not whether demographic processes in and of 
themselves can have an impact on the distribution of scores—that much 
is certain—but what and how big the impact is, compared to all the 
other forces pushing the distribution around. Mounting evidence indi- 
cates that demographic trends are exerting downward pressures on rhe 
distribution of cognitive ability in the United States and that the pres- 
sures are strong enough to have social consequences. 

We will refer to this downward pressure as dysgenesis, borrowing a 
term from population biology. However, it is important once again not 
to be sidetracked by the role of genes versus the role of environment. 
Children resemble their parents in IQ, for whatever reason, and im- 
migrants and their descendants may not duplicate the distribution of 
Americas resident cognitive ability distribution. lf women with low 
scores are reproducing more rapidly than women with high scores, the 
distribution of scores will, other things equal, decline, no matter 
whether rhe women with the low scores came by them through nature 
or nurture. 111 More generally, if population growth varies across the ränge 
of IQ scores, the next generation will have a different distribution of 
scores/ In trying to foresee changes in American life, what matters is 
how the distribution of intelligence is changing, more than udry. 

Our exploration of this issue will proceed in three stages. First, we 
will describe the state of knowledge about when and why dysgenesis oc- 
curs. Next, we will look at the present state of affairs regarding differ¬ 
ential birth rares, differential age of childbearing, and Immigration. 
Finally, we will summarize the shape of the future as best we can discern 
ir and describe the magnitude of the stakes involved. 


THE EVOLVING UNDERSTANDING OF DYSGENESIS 

The understanding of dysgenesis has heen a contesr between pessimists 
and optimists. For many decades when people first began to rhink sys- 
temarically about intelligence and reproduction in the late nineteenth 
Century, all was pessimism. The fertility rate in England began to fall in 
rhe 1870s, and it did not rake long for early students of demography to 
notice that fertility was declining most markedly at the upper levels of 
social Status, where the people were presumed to be smarter. 1 The larger 
families were turning up disproportionately in the lower classes. Darwin 
himself had nored that even within the lower classes, the smaller fam¬ 
ilies had the brighter, the more “prudent,” people in them. 

All that was needed to eonclude that this pattern of reproduction was 
had news for the genetie legacy was arithmetic, argued the British schol- 
ars around the turn of the twentieth Century who wanted to raise the 
intelligence of the population through a new Science that they called 
eugenics. 141 Their influence crossed the ocean to the United States, 
where the floud ol im migrants from Russia, eastern Hurope, and the 
Mediterranean raised a similar concern. Were those huddled masses 
hringing to nur shores a biological inheritance inconsistent with the 
American way of life? Some American eugenichts thought so, and they 
said as much to the Congress when it enacted rhe Immigration Act of 
1924, as we described in the Introduetion. 1 ^ 1 Then came scientific en- 
lightenment—the immigrants did not seem to be harming America’s 
genetie legacy a bit—followed by the terrors of nazism and its per Ver¬ 
sion of eugenics that effectively wiped the idea from public discourse in 
the West. But at bottom, the Victorian eugenicists and their successors 
had detected a demographic pattern that seeins to arise with great 
(though not universal) consistency around the world. 

For this story, let us turn first to a phenomenon about which there 
is no serious controversy, the demographic tramition. Throughout the 
world, the premodern period is characterized by a Balance between high 
death rates and high birth rates in which the population remains more 
or less constant. Then modernization hrings better hygiene, nutrition, 
and medicine, and death rates begin to fall. In the early phases of mod¬ 
ernization, birth rares rernain at their traditional levels, sustained by 
deeply embedded cultural and social traditions that encourage big tarn- 
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ilies, and population grows swiftly. But culture and tradition eventually 
give way to the attractions of small er families and the pracrical fact rhat 
when fewer children die, fewer children need ro be bom to achieve the 
same eventual state of affairs. Intrinsic birth rates begin to decline, and 
eventually the population reaches a slow- or no-growth state. 161 

The falling birth rate is a well known and widely studied feature of 
the deinographic. transition. What ts less well known, but seems to be 
true among Western cultures that have passed through the deinographic 
transition, is that declines in lifetime fertility occur disproporrionately 
among educated women and women of higher social Status (we will re- 
fer to such women as “privileged”), just as the Victorians thoughC 

Why? One reason is that privileged women lose their reproductive 
advantage. In premodern times, privileged young women were betrer 
nourished, better rested, and had better medical care fhan the unprivi- 
leged. They married earlier and suffered fewer marital disruptions. 6 The 
net result was that, on average, they ended up with rnore surviving chil¬ 
dren than did unprivileged women. As modernization proceeds, these 
advantages narrow. Another reason is that modern societies provide 
greater opportunities for privileged women to be something other than 
full-time mothers. Marriage and reproduction are often deferred for cd- 
ucation, for those women who have access to it. On the average, they 
spend rnore of their reproductive years in school because they do well 
in school, because their families support their schooling, or both. Neg¬ 
ative correlations between fertility and educational Status are likely to 
be the result. 

Even after the school years, motherhood imposes greater cosr in lost 
opportunities on a privileged woman than on an unprivileged one in 
the Contemporary West. 9 A child complicates having a career, and may 
nvake a career impossible. Ironically, even monetary costs work against 
motherhood among privileged women. By our definition, privileged 
women have rnore money than deprived women, but for the privileged 
woman, a child entails expenses that can strain even a high income— 
from child care for the infam to the cost of moving to an expensive sub- 
urb that has a good school System when the child gets older. In planning 
for a baby—and privileged women tend to plan their babies carefully— 
such costs are not considered optional but what must be spent to raise 
a child properly. The cost of children is one rnore reason that privileged 
women bear few children and postpone the ones they do bear, 10 

Meanwhile, children are likely to impose few opportunity costs on a 


very poor woman; a “career” is not usually seen as a realistic Option. 
Children continue to have the same attractions that have always led 
young women to find motherhood intriasically rewarding. And for 
women near the poverty line in most countries in the Contemporary 
West, a baby is either free or even profitable, depending on the specific 
terms of the welfare System in her country. 


The Demographie Transition Elsewhere 

The generalizarions in the text may be stated wirb confidence abour most 
comrrmniries in the West. Elsewhere, rhere is still much to be learned. 
Japan has passed through the demographic transition in that overall fer¬ 
tility bas dropped, but reproduction has not shifted as markedly toward the 
lower end of the scale of privilege as in the Western democracies. 11 The 
reason may he that in Japan, as in other East Asian societies, social oblig¬ 
at ions that encourage childhearing among the educated may rake prece- 
dence over the individualistic morives rhat might otherwise compete with 
parenrhood. Similar considerations may apply to Islamic communities as 
well, where the deinographic transition has heen weak. The Mormons öf¬ 
ter an American example of a weak demographic transition. 1 “ An account 
of the patterns of reproduction must eonsider cuhural, personal, religious, 
and familial factors, as weil as the rnore ohvious social variables, such as 
the rising leveis of educarion, women’s employmenr, and public health. 


Whatever the reasons and whatever the variations from commumty 
to community, the reality of the demographic transition in the modern 
West is indisponible and so, it would seem, is the implication. If repro- 
ductive rates are correlated with income and educational levels, which 
are themselves correlated with intelligente, people with lower intelli- 
gence would presumably he outreprodueing people with higher intelli¬ 
gente and rhereby producing a dysgenic effect. lM1 Can we find evidente 
that dysgenesis is actually Happening? 

The early studies from the United States, England, France, and 
üreece all seemed to confirm the reality of dysgenesisT ln the 1930s, 
the eminent psychometrician Raymond Catteil was predicting a loss of 
1.0 to 1.5 IQ points per decade, 16 while others were Publishingestimated 
losses of 2 to 4 IQ points per generation. 1 ' ln 1951, another scholar 
gloomily predicted that “if this trend continues for less than a Century, 
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England and America will he well on the way to hecoming narions of 
near half-wits.”^ The main source of their pessimism was that che av¬ 
erage IQ in large families was lower than in smaller families. 

Then came a period of optimism. lts harhinger was Frederick Oshorns 
Eugenic Hypothesis, first stated in 1940, whieh foresaw a eugenic effect 
arising from grearer equality of social and economic goods and wider 
availability of birth control. l|g * In the late 1940s, data hegan to come in 
that seemed to confirm this more sanguine view. Surveys in Scotland 
found that Scottish school children were getting higher IQs, not lower 
ones, despite the familiär negative relationship hetween family size and 
IQA Examining this and other new studies, Carteil reconsidered his Po¬ 
sition, concluding that past estimates might not have adequarely in- 
vestigated the relationship hetween intelligence and marriage rares, 
which could have skewed their results. 21 

The new optimism got a hoost in 1962 wich the publication of “In¬ 
telligence and Family Size: A Paradox Resolved,” in which the authors, 
usinga large Minnesota sample, showed how it was pussible tu have hoth 
a negative relationship hetween IQ and family size and, ar the same time, 
to find no dysgenic pattern for IQ.“ The people who had no children, 
and whose fertilities were thus omitted from the earlier statistics, the 
authors suggested, came disproportionately from the lower IQ portion 
of the population. From the early 1960s through 1980, a seri.es of stud¬ 
ies were published showing the same radically changed picture: slowly 
rising or almost stähle intelligence from generation to generation, de¬ 
spite the lower average IQs in the larger families. 12 ' 1 

The optimism proved to he ephemeral. As scholars examined new 
data and reexamined the original analyses, rhey found that the opti- 
mistic results tumed on factors that were ill understood or ignored at 
the time the studies were published. First, comparisons hetween suc- 
cessive generations tested with the same instrument (as in the Scottish 
studies) were contammated hy the Flynn effect, wherehy IQ scores 
(though not necessarily cognitive ability itself) rise secularly over time 
(see Chapter 13). Second, the samples used in the most-cited opti- 
mistic studies published in the 1960s and 1970s were unrepresentative 
of the national population. Most of them came from nearly all-white 
populations of States in the upper Midwest A Two of the important stud¬ 
ies published during this period were difficult to intcrpret because they 
were based not only on whites hut on males (estimating fertility among 
males poses numerous prohlems, and male fertility can he quite differ¬ 


ent from female fertility) and on samples that were restrieted to the up- 
per half of the ability distribution, thereby missing whar was going on 
in the lower half A 

Apart from these technical prohlems, however, another feature of the 
studies yielding optimistic results in the 1960s and 1970s limited their 
applicahility: They were based on the parents of the bahy boome rs, the 
children born hetween 1945 and ahout 1960. In 1982, demographer 
Daniel Vining, Jr., opened a new phase of the debate with the publica¬ 
tion of his cautiously titled article, “On the Possibi lity of the Reemer- 
gence of a Dysgenic Trend with Respect to Intelligence in American 
Fertility Differentials.”' 0 Vining presented data from rhe National Lon¬ 
gitudinal Survey cohorts selected in 1966 and 1968 (the predecessors of 
the much larger 1979 NLSY sample that we have used so extensive ly) 
supporting his hypothesis that people with higher intelligence tend to 
have fertility rares as high as or higher than anyone eise’s in periods of 
rising fertility hut that in periods of falling birth rares, they tend to have 
lower fertility rates. The American fertility rare had heen falling with- 
our a break since the late 1950s, as the bahy boom suhsided, and Vin¬ 
ing suspected that dysgenesis was again underway. 

Then two researchers from the University of Texas, Marian Van 
Court and Frank Rean, finding no evidence for any respite during the 
bahy boom in a nationally representative sample, determined that the 
childless members of the sample were not disproportionately low IQ at 
all; on the contrary, they had slightly higher IQs than people with chil¬ 
dren. Van Court and Bean coneluded that the United States had heen 

experiencing an unhroken dysgenic effect since the early years of the 

n 

Century. 

Since then, all the news has heen bad. Another study of the upper 
Midwest looked at the fertilities in the mid-1980s of a nearly all-white 
sample of people in Wisconsin who had heen high school seniors as of 
1957 and found a dysgenic effect corresponding to about 0.8 IQ point 
per generation.A 1991 study based on a wholly different approach and 
using the NLSY suggesrs that 0.8 per generation may he an underesri- 
mateA This study estimated the shifting ethnic makeup of the popula¬ 
tion, given the differing intrinsic birth rares of the various ethnic groups. 
Since the main ethnic groups differ in average IQ, a shift in Americas 
ethnic makeup implies a change in the overall average IQ. Even disre- 
garding rhe impact of differential fertility within ethnic groups, rhe 
shifting ethnic makeup by itself would lower the average American IQ 
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by 0.8 point per generation. Since rhe differential fertility vvithin those 
ethnic groups is lowering the average score for each group itself (as we 
show later in the chapter), the 0.8 estimate is a lower bound of the over- 
all population change. 

To summarize, there is still uncertainty about whether the United 
States experienced a brief eugenic interlude after World War II. Van 
Court and Bean conclude it has been all downhill since the early part 
of the twentieth Century; other researchers are unsure. 1E,0! There is also 
uncertainty deriving from the Flynn effect. James Flynn has by now con- 
vinced everyone that IQ scores rise over time, more or less everywhere 
they are studied, but there remains little agreement about what that 
means. For those who believe that the increase in scores represents am 
thentic gains in cognitive ability, the dysgenic effects may be largely 
swamped by Overall gains in the general environment. For those who 
believe that the increases in scores are primarily due to increased fest 
sophistication without affecting g, the Flynn effect is merely a statisri- 
cal complication that must be taken mto account whenever comparing 
IQ scores from different points in time or across different cultures. 

But within the scholarly Community, there is little doubt about dif¬ 
ferential fertility or about whether it is exerting downward pressure on 
cognitive ability. Further, the scholarly debate of the last fifty years has 
progressed: The margin of error has narrowed. Scientific progress has 
helped clarify the dysgenic effects without yer producing a precise cali- 
bration of exactly how much the distribution of cognitive ability is de- 
clining. This leads to our next topic, the current state of affairs. 

DYSGENIC PRESSURES IN AMERICA IN THE EARLY 1990S 

Foretelling the future about fertility is a hazardous Business, and fore- 
telling it in terms of IQ points per generation is more hazardous still. 
The unknowns are too many. Will the ranks of career women continue 
to expand ! Or might our granddaughters lead a revival of the traditional 
family? Flow will the environmental aspects of cognitive development 
change (judging from what has happened to SAT scores, it could be for 
worse as well as better)? 1111 Will the Fly nn effect continue? Even if it 
does, what does it mean? No one has any idea how these countervail- 
ing forces might play out. 

For all these reasons, we do not put much confidence in any specific 
predictions about what will happen to IQ scores decades from now. But 


we can say with considerable confidence what is happening right now, 
and the news is worrisome. |12J There are three major factors to take into 
account: the number of children born to women at various IQ levels, the 
age at which they have them, and the cognitive ability of immigrants. 

Cogrutme Ability and Number of Children 

Demographers often take a lifetime fertility of ahout 2.1 births as the di- 
viding line between having enough children to replenish the parent 
generation and having too few. lH * Bear that in mind while examining 
the figure below showing the “completed fertility”—all the babies they 
have ever had—of American women who had virtually completed their 
childhearing years in 1992, broken down by their educational attain- 

The higher the education, the fewer the babies 


Average number of children ever born 
to women ages 35^44 in 1992 

3- 



Less than High Some Associate Bachelor’s MA or 
high school College degree degree higher 

school 

Highest educational attainment 

Source: Bachu 199$,Table 2. 

ment. Overall, College graduat.es had 1.56 children, one child less than 
the average for women without a high school diploma. Let us consider 
the ratio of the two fertilities as a rough index of the degree to which 
fertility is tipped one way or the other with regard to education. A ra¬ 
tio greater than 1.0 says the tip is toward the lower educational levels. 
The actual ratio is 1*71, which can be read as 71 percent more births 
among high school dropouts than among women who graduated from 
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College. At least since the 1950s, the ratio in the United States has been 
between 1.5 and 1.85. h4] 

What does this mean for IQ? We may compute an estimate by using 
what we know about the mean IQs of the NLSY women who reached 
various levels of education. Overall, these most rec.ent data on Ameri¬ 
can fertility (based on women ages 35 to 44 in 1992, when the survey 
was taken) implies that the overall average IQ of American mothers was 
a little less than 98.'^ This is consistent with the analyses of American 
fertility that suggest a decline of at least 0.8 poinr per generation. 

This estimate is strengthened by using an altogether different slice of 
the national picture, based on the birth statistics for virtually all babies 
born in the United States in a given year, using the data compiled in 
Vital Statistics by the National Center for Health Statistics (NCHS). 
The most recent data available as we write, for 1991, provide modestly 
good news: The proportions of children born to better-educated 
women—-and therefore higher-IQ women, on average-—have been go- 
ing up in the last decade. The proportion of babies born to women with 
sixteen or more years of school (usually indicating a College degree or 
better) rose from 4.8 percent in 1982 to 5.9 percent in 1991. The pro¬ 
portion of babies born to wo men with something more than a high 
school diploma rose from 34-2 percent to 38.2 percent—small changes 
but in the right direction. The bad news is that the proportion of chil¬ 
dren born to women with less than a high school education has risen 
slightly over the last decade, from 22 percent to 24 percent, attrihutable 
to an especially steep rise among white women since 1986. 

In trying to use the educational Information in Vital Statistics ro esti- 
mate the mean IQ of mothers in 1991, it is essential to anticipate the 
eventual educational attainment of women who had babies while they 
were still of school age. After doing so, asdescribed in the note, |ir>1 the es- 
timared average IQ of women who gave birth in 1991 was 98. Consider- 
ing that census data and the Vital Statistics data come from different 
sources and take two different slices of the picture, the similarities are re- 
markablc. The conclusion in both cases is that differential fertility is ex- 
erting downward pressure on IQ. At the end of the chapter, wc show how 
much impact changes of this size may have on American society. 

What of evidence about dysgenesis in the NLSY itself? As of 1990, 
the women of the NLSY, ages 25 to 33, still had many childbearing years 
ahead. Presumably the new hirths will be weighted toward more highly 
educated women with higher IQs. Therefore the current mean IQ of the 


mothers of the NLSY children will rise. Currently, however, it Stands at 
less than 96 h7! 

Cognitive A bility and Mothers Age 

Population growth depends not just on the total number of children 
women have but on how old they are when they have them. The effect 
is dysgenic when a low-IQ group has babies at a younger age than a high- 
IQ group, even if the total number of children born in each group even- 
tually is the same. Because this conclusion may not be intuitively 
obvious, think of a simplified example. Suppose that over several gen- 
erations Croup A and Group R average exactly the same number of chil¬ 
dren, but all the women in Group A always have their babies on their 
twentieth birthday and all the women in Group R have their children 
on their rhirtieth birthday. The women in group A will produce three 
generations of children to every two produeed by Group B. Something 
like this has been Happening in the United States, as women of lower 
inte Ui ge nee have babies younger than wo men of higher intelligence. 
The NLSY once again hecomes the best source, because it provides age 
and education along with IQ scores. 

The oldest women in the NLSY had reached the age of 33 in 1990, 
by which time the great majority of first hirths have taken place.We 
can thus get a good idea of how age at first birth or average age at all 
hirths varies with cognitive ability, recognizing that a small minority of 
women, mostly highly educated and at the upper portion of the IQ dis- 
rriburion, will eventually nudge those results slightly. 1 ^ 1 We will not try 
to compensate for these missing data, because the hrunt ot our argutnent 
is that the timtng of births has a dysgenic effect. The hiases in the data, 
reported in the table below for women who were 30 or older, tend to 
understate the true magnitude of age differences by lQ. f4vl 

The average age at first birth was a few months past the 23d birth¬ 
day. This varied widely, however, by cognitive dass. Combining all the 
ethnic groups in the NLSY, women in the bortom 5 percent of intelli- 
gence have their first bahy more than seven years younger than women 
in the top 5 percent. When these figures are computed for the average 
age for all births (not just the first birth, as in the table), women in the 
hottom 5 percent have their babies (or all ot the ones they have had by 
their early thirties) at an average of five and a half years earlier. This 
gap will grow, not shrink, as the NLSY women complete their child- 
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Age 

at Childbearing 

Cognitive Class 

Mean Age at First Birth 

1 Very bright 

27.2 

11 Bright 

25.5 

III Normal 

23.4 

IV Dull 

21.0 

V Very dull 

19.8 

Overall average 

23.1 


bearing years. Even using the current figures, wonien in the bottom 5 
percent of the IQ distribution will have about five generations for every 
tour generations of the top 5 percent. A large and oben ignored dys- 
genic pressure from differences in age at hirrh is at work. 

ETHNIC DIFFERENCES IN FERTILITY 

Whatever the ethnic differences in cognitive ahility are now, they may 
change if ethnic groups differ in the extent to which their fertilities are 
dysgenic or not. In the long run, the vector of demographic trends in 
Intelligence—con verging or di verging across ethnic groups—could pro- 
foundly affect Americas future. 

Fertility Rates bv Etfmicipy 

In the 1992 analysis of American fertility using the Current Population 
Survey (CPS) to which we referred for a national estimate of dysgene- 
sis, women ages 35 to 44 had given birth to an average of 1.94 children: 
1.89 for white women, 2.23 for black women, and 2.47 for Latino 
women. 1 * 1 Similar or larger ethnic differences have characterized fertil¬ 
ity data for as long as such data have heen available, and they have led 
to a widespread belief that something in black and Latino culrure leads 
them to have larger numbers of children than whites do. We do not dis- 
pute that culture can influence family size—the Catholic tradition 
among Latinos may foster high Overall birth rates, for example—but the 
trends for the three groups are similar once the role of educahonal level 
is held constant. Consider the figure below, hased on the 1992 CPS study 
of fertility, again using women in the 55 to 44 age group who have nearly 
completed their childbearing years. 

This figure represents almost total lifetime fertilities, and it teils a 
simple story. In all three groups of women, more education means lower 


Fertility falls as educational level rises in similar fashion for black, 
white, and Latino women 


Average number of children ever born 
to women ages 35-44 in 1992 



1 - 

Less than High Some Assoeiate Bachclor’s MA or 

high school College degree degree higher 

schooi 

Highest educational attainment 

Source: Bachu 1W, Table 2. 


fertility. The two minority groups have higher overall fertility, but not 
hy much when education is taken into accounr. Given the known re- 
lationship between IQ and educational attainment, fertility is also 
falling wiLh rising IQ for each ethnic group. Indeed, if one tries to look 
into this relationship hy assigning IQ equivalents based on the rela- 
tionship of educational attainment and cognitive ahility in the NLSY, 
it appears that: after equating for IQ, black women at a given IQ level 
may have lower fertility rates than either white or Latino womcn. 1 ^ 1 
May we then conclude that whites, hlacks, and Latinos are on a 
downhill slope together, neither converging nur diverging in IQ l No, 
for two reasons. The first is that each ethnic group has different pro- 
portions of women at different IQ levels. For example, black women 
with IQs of 90 and below prohably have a fertility rate no higher than 
that of white women with the same IQs. But even so, only 15 percent 
of white women in the NLSY fall in the 90-and-below ränge, compared 
with 52 percent of black women. The relatively higher fertility rates of 
women with low IQs therefore have a larger impact on the black pop- 
ulation as a whole than on the white. Even if two ethnic groups have 
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equal birth rates at a given IQ, one group may have a larger Proportion 
of its bahies than the other at that IQ. This is illustrated by the next 
table, which uses the NLSY to see what the next generation looks like 
so far, when rhe women of the NLSY had reached the ages of 25 ro 3 3. 


The Next Generation So Far, for 
Three Ethnie Groups in the NLSY 


As of 1990, the Percentage of 
Children Born to Women with: 



IQs Less 

IQs Higher 


than 90 

than 110 

Whites 

19 

22 

Blacks 

69 

2 

Larinos 

64 

2 

National populärion 

33 

15 


Gedding whether the discrepancy between whites and both blacks 
and Latinos implies an increasing gap in cognitive ability would require 
extensive modeling involving many assumptions. On the face of it, the 
discrepancies are so dramatically large rhat the probability ot Umher 
divergence seems substantial. Furthermore, insotar as whites have the 
highest proportion of collcge-educated women who are delaying child- 
birth, the gap between whites and the other minoriries is more likely to 


Delayed Childbearing Across Ethnie Groups 


The ages of rhe women in the NLSY ranged from 25 to 3 3 as ot our last 
observation of them, meaning that more children remain to be horn, a dis- 
proportionate numher of whom will be born to women at the higher lev- 
eis of cognirive ability. This prevenred us from using the NLSY to make 
any estiinate of the overall dysgenic effect. But the remaining ehildbear- 
ing years are less of a problem when comparing differenrials among ethnic 
groups. The evidence suggests that better-educated women of all ethnic 
groups postpone childbearing, to similar degreesA Based on this experi- 
ence, the differenrials as they exist among ethnic groups in the 25-3 3 age 
cohort will probahly remain about the same through the rest of rhe NLSY 
womens childbearing years, rhough rhe means for each group will proha- 
hly rise somewhat. lnsofar as an artifact exists, it presumably acts to uiv 
derstate the eventual mean for whites, since whites have rhe largest 
proportion of women with College and advanced degrees, and rherefore 
presumably the largest group of high-IQ women delaying childbirth. 


increase than to diminish as the NLSY women completc their child¬ 
bearing years. 

Age at Birth by Ethnicity 

The second potential source of divergence between ethnic groups lies 
in the ages at which women are having their children. For NLSY moth- 
ers, the average ages when they gave birth as of 1990 (when they were 
ages 25 to 33) were 24.3 for whites, 23.2 for Latinos, and 22.3 for blacks. 
Once again, these gaps may be expected ro increase as the NLSY women 
complete their childbearing years. lf these age differentials persist over 
time (and they have heen found for as long as the statistics for rhe dif¬ 
ferent groups have been available), they will produce increasing diver- 
gence in the mean cognitive ability of successive generations for the 
three groups. Evidence from other sources confirms the NLSY, finding 
an increasing gap between white and non white (primarily black) 
women in when their reproductive lives begin, and also in their likeli- 
hood of remaining childless. 44 

Mothers and Children in the NLSY 

As we leave this topic, we may see how these various forces have played 
out so far in the successive generations of the NLSY. The NLSY has 
been testing the children of its original suhjects, which should eventu- 
ally provide one of the deanest estimates of dysgenic trends within eth¬ 
nic groups. The version of an IQ measure that the NLSY uses is the 
Peabody Picture Vocabulary Test (PPVT), a highly reliable, g-loaded 
test that does not require that the child be able to read. It was normed 
in 1979 with a national sample of 4,200 children to a mean of 100 and 
a Standard deviation of 15. 

lf we take the NLSY results at face value, American intelligence is 
plunging. The mean of the entire sample of NLSY children tested in 
1986 and 1988 is only 92, more than half a Standard deviation helow 
the national mean. We cannot take these results at face value, however. 
The NLSY’s sampling weights make the results “representative of the 
children of a nationally representative sample of women” who were of 
certain age ranges in the years the tests were given—which is subtly but 
importantly different from being a representative sample of American 
children. 45 But although it is not possible to interpret the overall chil- 
dren’s mean with any confidence, it is possible to compare the children 
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of women in different ethnic groups. The results tor children at least six 
years old and their mothers, shown in the table below, indicate that the 
gap between the children is karger than the gap separating the mothers, 


Ethnic Differences in Test Scores 

in Two Generations 

Ethnic 

Gap Separating 
the Mothers 

Gap Separating 
the Children 

Comparison 

in IQ Points 

in IQ Points 

White-black 

n .2 

17.5 

White-Latino 

12.2 

14.1 


by more than 4 points in the case of blacks and whites, hy almost two 
points in the case of whites and Latinos. There are technical reasons to 
hedge on any more specific Interpretation of these data. 1401 We may at 
least say that the results point in a worrisome direction. 

Pulling these different views of the Situation together, the data reveal 
demographic pressures for further ethnic divergence in IQ. We will not 
hazard a guess about the magnitude of ethnic divergence or its speed. 
Within another decade, assuming that the NLSY continues its testing 
program, guesses will not be necessary. When large numbers of the 
NLSY women approach the end of their childbearing years and their 
children have been tested after reaching an age when IQ scores are stä¬ 
hle, we not only will be able to answer whether and how much erb nie 
groups diverged for that generation of Americans but be able to pin 
down answers to many of the other questions about dysgenic effects na- 
tionwide. 

IMMIGRATION 

Immigration is an even older American trip wire for impassioned de- 
bäte than differential fertility, and the disputes continue to the present 
day. 4 ^ The reason is not hard to find: America has more people flowing 
into it than any other country. About half of the world’s migrants re- 
settling in new countries are coming to America as we write. 50 The peo- 
ple already living here have always viewed this influx of newcomers with 


Regression to the Mean to the Rescue? 


Th ose who dismiss the import.ance of dysgenic trends have mistakenly 
latched onto the Statistical phenomenon known as regression to the mean 
as a magic cure-all. The editorial page of the Neu> York Times, no less, is on 
record with an assurance to its readers that because of regression to the 
mean, eaeh successive generation of children of helow-average ]Q women 
will get closer to the average and rherefdre black and white scores will tend 
to converge. 4/ Alas, it doesn’t work that way. The results on the PPVT pro- 
vide a concrete illustrarion. 

Suppose rhat we recaleulate the gap between the three ethnic groups in 
two successive generations, this time expressing them in terms of Standard 
Jeviations based on the mothers’ and child re ns’ own Standard deviations, 
not on their place within rhe national distribution (as in the preceding 
table). 


Regression to the Mean and Ethnic Differences 
in Test Scores in Two Generations 


Ethnic 

Comparison 

White-hlack 

White-Latino 


Gap Separating 
the Mothers 
in SDs 
1.17 
1.05 


Gap Separating 
the Children 
in SDs 
1.17 
.95 


Üalculated in this way and shown in rhe table ahove, the gap between 
white and Latino children has shrunk somewhat compared to the gap 
separating their mothers. The gap between white and black children has 
at least grown no largerT Why can we obtain this result and still show a 
growing gap in IQ points between the ethnic groups? The answer is that 
“mean” referred to in “regression to the mean” is the population’s own mean. 
White children of dull white women will, on average, he eloser ro the mean 
for whites in their generation than their mothers were in their generation. 
A parallel Statement applies to black children of dull black women. Rat 
this does not necessarily imply that the IQ scores of black and white 
children must be eloser to each other than their mothers’ IQ scores were. 
It is a slippery eoncept. Some people find it is hei pfui to remember rhar 
regression to the mean works both ways: If you statt with a population of 
dull children and rhen find the IQs of their parents, you will find that 
rhe parents were eloser to the mean (on average) than their children. 
Regression to the mean is a Statistical phenomenon, not a hiological 
one. 
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complicated reactions ranging from pride to alarm. John Highatn and 
others have traced the crests and troughs of nativism and xenophobia, 
often laced with open racism, in our historyd 1 

Recently the debate over Immigration has intensified, as the [arge iiv 
flux of immigrants in the 1980s, legal and illegal, has reopencd all the 
old arguments. These who favor open immigration policies point to the 
adaptability of earlier immigrant populations and their contribution to 
America $ greatness, and remind us that the dire warnings of earlier anti- 
immigrationists were usually unfoundedQ Antnimmigrationists instead 
emphasize the concentration within some immigrant groups of people 
who commit crimes, fail to work, drop out of school, and go on public 
assistance. They see Umits in the American capacity for assimilating 
people from alien cultures and for tinding productive work for them. 1 * 

lt seems apparent that there are costs and benefits to any immigra' 
tion policy and that no extreme view, pro or con, is likely to he correct. 
Beyond that truism, it is apparent that the normative “American” will 
und ergo at least as large a change in the twenty'first Century as he has 
since the original Settlement. The nearly 100 percent of immigrants 
from northern and wesrern Europe in the original Settlement gave way 
to increasing fractions from Africa and from Southern and eastern Eie 
rope throughout the nineteenth Century, thence to a large majority from 
Asia and Latin America today. America was remade several times over 
by its immigrants before, and we trust the process will continue. By 2080, 
according to a typical estimate, Americas population will be less than 
50 percent non-Latino white, 15 percent black, 25 percent Latino, and 
over 10 percent Asian and other. h4J Multiculturalism of some sort is een 
tain. Whether it will be a functioning multiculturalism or an unraveb 
ing one is the main question about Immigration, and not one we can 
answer. 

Our first objective is simply to bring to people s attention that the 
question is important. Legal immigration in the 1980s contributed 29 
percent of the United States’ net population increase, much more than 
at any earlier period in the postwar era. 1 ’’ 51 If illegal immigration could 
be included, the figure would be significantly higher. Immigration does 
indeed make a difference to thefuture of the national distribution of in' 
telligence. It may not make as much difference as births in terms of raw 
numbers, but there is also this consideration: Whereas policy can have 
only long-term effects on the cognitive distribution of births, it can have 
large immediate effects on the nature of the immigrant population. 


There are few, if any, other domains where public policy could so di' 
rectly mold the cognitive shape of things to come. Meanwhile, the na~ 
tion’s political ground rules have yet. to accept that the intelligen.ee of 
immigrants is a legitimate topic for policymakers to think about. 

Ethnieity and IQ as They Apply to Immigration 

In trying to estimate an envelope of what the effects on the cognitive 
distribution might be, a useful first step is to assume that immigrants to 
the United States have the mean IQ that has generally been tound 
among persons of that ethnic group, then apply those numbers to the 
actual distribution of immigrants by ethnicity. Keeping in mind that we 
are hoping to do no more than establish a ränge of possibilities, we will 
begin by following Richard Lynn’s computations based on a review of 
the international data and assign means of 105 to East Asians, 91 to Pa' 
cific populations, 84 to blacks, and 100 to vvhites. ö We assign 91 to Lato 
nos. We know of no data for Middle East or South Asian populations 
that permit even a rough estimate. They and an unclassifiable “other” 
component in the immigration statistics constitute about 11 percent of 
immigrants and are omitted from the analysis. The ethnic ancestry of 
legal immigrants in the 1980s breaks down as follows: 1 


Latino 41% 

East and Southeast Asian 21% 

NoivLatino white 11% 

Black 9% 

Filipino 7% 

Middle East, South Asian, other 11% 


Applying the assigned IQ means to this breakdown, the mean IQ of 
immigrants in the 1980s works out to about 95—essentially unchanged 
from the 1960s and the 1970s (when the same procedure yields estb 
mates of 96 and 95 respectively). As the proportion of noivLatino 
whites dropped from 46 percent of immigrants in the 1960s to 11 per- 
cent in the 1990s, the percentage of East and Southeast Asians rose from 
6 percent to 21 percent, two counterbalancing trends regarding IQ. 

Modifying the estimates of ethnic lQs does not make much differ- 
ence. Some would argue that the East Asian mean is too high. Suppose 
we drop it to 100. Some would argue that the Latino mean is too low. 
Suppose we increase it to 94- We could shitt the black estimate up or 
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down by large amounts wirhout affecting die overall mean very far. Fid- 
dling with the numbers moves the overall estimated mean by only abour 
a point or two for defensible sets of values. The basic Statement is that 
about 57 percent of legal immigrants in the 1980s came from ethnic 
groups that have scores significantly below the white average, and in 
consequence the IQ mean for all immigrants is likely to be below 100. 

How about the idea that people who are willing to pack up and move 
to a stränge place in search of a hetter life are selTselected for desirable 
qualities such as initiative, determination, energy, and perhaps inrelli- 
gence as well? Given this plausible expectation, why not assutne that 
the mean for immigrants is significantly higher than average for their 
ethnic groups? Here, the NLSY provides a snapshot of the effects on the 
distribution of intelligence of the people coming across our borders, in- 
sofar as we may compare the IQs of those who were hom abroad with 
those who were born in the United States. 

Overall, the IQ of NLSY members who were born abroad was .4 Stan¬ 
dard deviation lower than the mean of those who were born in the 
United States, putting the average immigrant for this cohort at about 
the Mth centile of the native-born population. A breakdown of these 
results by ethnic groups reveals that different groups are making differ¬ 
ent contrihutions to this result. White immigrants have scores that put 
them a bit ahove the mean for the native-born American population 
(though somewhat lower than the mean for native-born American 
whites). Foreign-horn blacks score about five IQ poinrs higher than na- 
rive-bom blacks, for reasons we do not know. Latino immigrants have 
mean scores more than seven points lower than native-born Larinos and 
more than a Standard deviation below the overall national native-born 
mean. The NLSY gives no Information on the large immigrant popula¬ 
tion from the counrries of East Asia and Vietnam, who might he signif¬ 
icantly boosting the immigrant mean. 

Even considered simply as cognitive test scores, these results must he 
interpreted very cautiously. Immigrants typically earn higher scores on 
tests as they become acculturated, even on tests dcsigned to be “culture 
fair.’ n> The extremely large gap between native-born and foreign-bom 
Latino students seems likely to reflect additional effects of poor English. 
We do not know if this rise with accukuration is enough to counter- 
halancc the overall .4 Standard deviation disadvantage of a sample born 
elsewhere. Nonetheless, keeping all of these qualifications in mind, the 
kerne 1 of evidence that must also be acknowledged is that Latino and 


black immigrants are, at least in the short run, putting some down ward 
pressure on the distribution of intelligence. 

Self-Selection Fast and Present 

Many readers will find these results counterintuitive—the concept of 
the high-achieving immigrant is deeply ingrained in Americans’ view 
of our country—hut a few moments reflection, plus some additional 
data, may make the results more understandable. |5M| 

Think back to the immigrant at the turn of the Century. America was 
the Land of Opportunity—but that was all. There were no guarantees, 
no safety nets. One way or another, an immigrant had to make it on his 
own, Add to that the wrench of tearing himself and family away from a 
place where his people might have lived for centuries, the terrors of hav- 
ing to learn a new language and culture, often the prospect of working 
at jobs he had never tried before, a dozen orher reasons for apprehen- 
sion, and the United States had going for it a craekerjack self-selection 
mechanism for attracting immigrants who were brave, hard-working, 
imaginative, self-starting—and probably smart. Immigration can still 
select for those qualities, but it does not have to. Someone who comes 
he re because his cousin offers bim a job, a free airplane ticket, and a 
place to stay is not necessarily self-selected for those qualities. On the 
contrary, immigrating to America can be for that person a much easier 
Option than staying where he is. 

Economists have made considerable progress in understanding how 
the different types of immigration (and all the ones in between) have 
played out in practice. To begin wirh, it has been demonstrated beyond 
much douht that immigrants as a whole have more steeply rising earn- 
ings than American natives of equal age and measured skills and that, 
after a relatively short adaptation period of ten to fifteen years, immi¬ 
grants of equal age and education earn as much as natives. 00 Here is em- 
pirical Support for the proposition that immigrants taken as a whole are 
indeed self-selected for qualities that lead to economic success, and one 
might expect cognitive ability to be among them. 

But the experience of different immigrants at different times has var- 
ied drastically. Economist George Rorjas has systematized the conditions 
under which immigrants will be self-selected from the upper and lower 
tails of the ahility distribution. Suppose, he says, that you are living in 
a foreign country, considering whether to emigrate to America. Pre- 
sumably a major consideration is your potential income in the Unired 



362 The National Context 


States versus your income if you stay put. Borjas proposes that this caL 
culation interacts wich a persons earning potential. It makes sense for 
high-ability people to emigrate when they can reasonably think that 
they are heing underrewarded in their home country, relative ro their 
ability, and that the United States rewards the same level of ability more 
generously. It makes sense for low-ahility people to emigrate when they 
can reasonably think that the United States not only pays better for the 
same work but protects them against poor labor market outcomes (in 
comparison to their birth country) with welfare payments and other etv 
titlements. 61 ln other words, the United States may be expected to draw 
high-ability workers from countries that have more extensive welfare 
States and less income inequality than the United States (such as West' 
em Europe), and will draw low-ability workers from countries that have 
less extensive welfare States and higher income inequality (such as the 
poorer countries of the Third World). 

Borjas used census data from 1970 and 1980 to examine the experi- 
ence of immigrants from forty-one countries. In his analysis, he holds 
constant the individual immigrants schooling, age, marital Status, 
health, and the merropolitan area where the immigrant settled. By hold- 
ing completed schooling constant, Borjas also factored out some ot the 
influence of cognitive ability. But the educational Systems in the non' 
European countries of origin (where we will focus our attention) are 
much less efficient at identifying talent than the American educational 
System; many bright immigrants have little formal education. We may 
think of the unmeasured residual that Borjas did not hold constant as a 
cluster of personal and cultural qualities, among which is some role for 
cognitive ability. With this in mind, the Borjas data reveal two impor¬ 
tant findings. 

ln the 1960s and 1970s, America became much more of a welfare 
state. Consistent with that, the eamings potential of the Latino immi- 
grant group feil substantially from 1955 through 1980. Among the non- 
European countries, three of the four steepest declines in earnings 
potenrial were among immigrant groups from Colombia, the Domini- 
can Republic, and Mexico, all large contributors to the Latin American 
immigrant population. Many of the other countries were not included 
in Borjass forty-one countries, so we do not know whether they followed 
the same pattem. Among the Latin American and Laiino-Caribbean 
nations, only the immigrant groups from Cuba, Brazil, and Panama had 
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improvmg potential by Borjass measures. The 1980 Mexican wave of 
immigrants had an earnings potential about 1 5 percent lower than the 
wave that arrived in 1955. For the Dominiean Republic and Colombia, 
the earnings potential of the 1980 wave was more than 30 percent lower 
than those who caine in 1955, a decline that remains after holding ed¬ 
ucation, marital Status, age, and location constant. 62 

Sirmlarly, the success of the early vvaves of West Indian blacks seems 
unlikely to repeat itself. In his book Ethnie America, Thomas Sowell de- 
senhed the successes of West Indian black immigrants, starting from 
early in the twentieth Century, noting among other things that, by 1969, 
second-generation West Indian blacks had a higher mean income than 
whites/’* His account has since become widely cited as evidence for 
everything from the inherent equality of black and white earning abil¬ 
ity to the merits of unrestricted immigration. The Borjas data include 
three of the major contributors of black immigrants from that region: 
Jamaica, Haiti, and Trinidad/Tobago. The earnings potential of the im¬ 
migrant cohorts from these countries in 1970 ranged from 31 to 34 per- 
cent less than American natives (after hold ing education, marital Status, 
age, and location constant). 64 ln 1980, the earnings potenrial from the 
most recent immigrant waves from these three countries ranged from 
26 to 52 percent less than American natives. Immigrants from all three 
countries are on an extremely slow route to income equality, with Ja- 
maicans and Haitians lagging behind everyone except the lowest-rank- 
ing Latin American countries. Borjass study did not include immigrants 
from any countries in sub-Saharan Africa. 

The results for European immigrants were also consistent with the 
theory. Borjas s overall appraisal of the data is worth quoting in full: 

The empirical analysis of the earnings of immigrants from 41 differ¬ 
ent countries using the 1970 and 1980 censuses shows that there are 
streng country-specific fixed effects in the (labor market) quality of 
foreign-born persons. In particular, persons from Western European 
countries do quite well in the United States, and their cohorts have 
exhibited a general incrcase in earnings (relative to their measured 
skills) over the postwar period. On the other hand, persons from less 
developed countries do not perform well in the U.S. labor market and 
their cohorts have exhibited a general decrease in earnings (relative 
to their measured skills) over the postwar period/ 0 
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These analyses should not obscure the energy and ability that we of- 
ten see among immigrants, whether they are Staffing the checkout 
counter at the comer convenience störe or teaching classes in the na- 
tion’s most advanced research centers. The observations of everyday Life 
and the Statistical generalizations we have just presented can both he 
trete at the same time, however. 

HOW IMPORTANT IS DYSGENIC PRESSURE? 

Putting the pieces together—higher fertility and a faster generational 
cycle among the less intelligent and an Immigrant popularion that is 
probahly somewhat helow the native-hom average—the case is strong 
that something worth worrying about is happening to the cognitive Cap¬ 
ital of the country. How big is the effect? If we were to try to put it in 
terms of IQ points per generation, the usual metric for such analyses, it 
would be nearly impossible to make the total come out to less than one 
point per generation. It might be twice that. Rut we hope we have cm- 
phasized the complications enough to show why such estimates are only 
nvarginally useful. Even if an estimate is realistic regarding the current 
Situation, it is impossible to predict how long it may be correct or when 
and how it may change. It may shrink or grow or remain stähle. De- 
mographers disagree about many things, but not that the furrher into 
the future we try to look, the more likely our forecasts are to he wrong. 

This leads to the last issue that must be considered betöre it is fruit- 
tul to talk about specific demographic policies. So what if the mean IQ 
is dropping by a point or two per generation? One reason to worry is that 
the drop may be enlarging ethnic ditferences in cognitive ability at a 
time when the nation hadly needs narrowing differences. Another 
reason to worry is that when the mean shifts a little, the size of the tails 
of the distribution changes a lot, For example, assuming a normal dis- 
tribut.ion, a three-point drop at the average would reduce the Propor¬ 
tion of the population vvith lQs above 120 (currently the top decile) by 
31 percent and the proportion with lQs above 135 (currently the top 1 
percent) by 42 percent. The proportion of the population with IQs he¬ 
low 80 (currently the bottom decile) would rise by 41 percent and the 
proportion with IQs helow 65 (currently the bottom 1 percent) would 
rise by 68 percent. Given the predictive power of IQ scores, particularly 
in the extremes of the distribution, changes this large would protoundly 


alter many aspects of American life, none that we can think of to the 
good. 

Suppose we select a subsample of the NLSY, different in only one re- 
spect from the complete sample: We random ly delete persons who have 
a mean IQ of more than 97, until we reach a sample that has a mean IQ 
of 97—a me re three points below the mean of the full sample. 1601 

How different do the crucial social outcomes look? For somc behav- 
iors, not mach changes. Marriage rares do not change. With a three- 
point decline at the average, divorce, unemployment, and dropout from 
the labor force rise only marginally. But the overall poverty rate rises by 
1 1 percent and the proportion of children living in poverty throughout 
the first three years of their lives rises by 1 3 percent. The proportion of 
children horn to single mothers rises by 8 percent. The proportion of 
men interviewed in jail rises by 13 percent. The proportion of children 
living with nonparental custodians, of women ever on welfare, and of 
people dropping out of high school all rise by 14 percent. The propor- 
t ion of young men prevented from working by health prohlems increases 
by 18 percent. 

This exercise assumed that everyrhing eise but IQ remained constant. 
ln the real worid, things would no doubt be more complicated. A Cas¬ 
cade of secondary effect s may make social conditions worse than we sug- 
gest or perhaps not so bad. Rut the overall point is that an apparently 
minor shift in IQ could produce important social outcomes. Three 
points in IQ seem to he nothing (and indeed, they are nothing in terms 
of anders randing an individuals ability), but a population with an IQ 
mean that has slipped three points is likely to be importantly worse off. 
Furthermore, a three-point slide in the near-rerm future is well within 
the realm of possibility. The social phenomena rhat have been so wor- 
risome for the past few decades may in so me degree al ready re fl ec t an 
ongoing dysgenic effect. It is worth worrying about, and worrh trying to 
do something ahout. 

At the same time, it is not impossible to imagine more hopeful 
prospects. After all, IQ scores are rising wirb rhe Flynn effect. The na¬ 
tion can spend more money more effectively on childhood interven- 
rions and improved education. Won’t these tend to keep this 
three-point fall and irs consequences from actually happening? They 
may, but whatever good things we can accomplish with changes in the 
environment would be that much more effective if they did not have to 
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How Would We Know That IQ Has Been Falling? 


Can the United States really have been experiencing falling IQ? Would 
not we be able to see the consequences? Maybe we have. In 1938, Ray¬ 
mond Gatte 11, one of inost illustrious psychometricians of his age, wrote an 
article for the British Journal of Psychology , “Some Changes in Social life in 
a Community with a Falling Intelligence Quotient. 1,67 The article was 
eerily prescient. 

In education, Cattell predicted that academic Standards would fall and 
the curriculum would shift toward less abstract subjects. He foresaw an in- 
crease in “delinquency against society”—crime and willful dependency (for 
example, having a child without being able to care for it) would be in this 
category. He was not sure whether this would lead to a slackening of moral 
Codes or attempts at tighter govemment control over individual behavior. 
The response could go either way, he wrote. 

He predicted that a complex modern society with a falling IQ would 
have to compensate people at the low end of IQ by a “systematized relax- 
ation of moral Standards, permitting more direct instinetive satisfac- 
tions.” 68 ln particular, he saw an expanding role for what he called “fantasy 
compensations.” He saw the novel and the cinema as the Contemporary 
means for satisfying it, but he added that “we have probably not seen the 
end of its development or begun to appreciate its damaging effects on ‘re¬ 
al ity thinking’ habitsconcerned in otherspheres of life”—a prediction hard 
to fault as one watches the use of TV in today’s world and imagines the use 
of virtual reality helmets in tomorrow’s. 6g 

Tuming to political and social life, he expected to see “the development 
of a larger ‘social problem group’ or at least of a group supported, super- 
vised and patronized by extensive state social welfare work.” This, he fore¬ 
saw, would be “inimical to that human solidarity and potential equality of 
prestige which is essential to democracy." 1701 


fighr a demographic head wind. Perhaps, for example, making the en- 
vironment better could keep the average IQ at 100, instead of falling to 
97 because of the demographic pressures. But the same improved envi- 
ronment could raise the average to 103, if the demographic pressures 
would cease. 

Suppose that downward pressure from demography stopped and 
maybe modestly turned around in the other direction—nothing dra- 
matic, no eugenic surges in babies by high-lQ women or draconian mea- 
sures to stop low-IQ women from having habies, just enough of a shift 


so that the winds were at least heading in the right direction. Then im- 
provements in education and childhood interventions need not strug- 
gle to keep us from falling behind; they could bring real progress. Once 
again, we cannot predicr exactly what would happen if the mean IQ rose 
to 103, for example, but we can describe what does happen to the sta- 
tistics when the NLSY sample is alrered so that its subjects have a mean 
of L03. 1711 

For srarters, the poverty rate falls by 25 percent. So does the Propor¬ 
tion of males ever interviewed in jail. High school dropouts fall hy 28 
percent. Children living without their parents fall by 20 percent. WeT 
fare recipiency, hoth temporary and chronic, falls by 18 percent. Chil- 
dren horn out of wedlock drop hy 15 percent. The incidence ot 
low-weighr births drops hy 12 percent. Children in the bottom decile 
of home environmentsdrop hy 13 percent. Children who live in poverty 
for the first three years of their lives drop hy 20 percent. 

The stories of falling and rising IQ are not mirror images of each other, 
in part for Technical reasons explained in the note and partly because 
the effects of ahove- and below-average IQ are often asymmetricaL 1 ^ 1 
Once again, we must note that the real world is more complex than in 
mir simplified exercise. But the basic implication is hard to dispure: 
With a rising average, the changes are positive rather than negative. 

Consider the poverty rate for people in the NLSY as of 1989, for ex¬ 
ample. It stood at 11.0 percent. 17 * 1 The same sample, Jepleted of above- 
97 IQ people until the mean was 97, has a poverty rate of 12.2 percent. 
The same sample, depleted of below-103 IQ people until the mean was 
103, has a poverty rate of 8.3 percent. This represents a swing of almost 
four percentage points—more than a third of the actual 1989 poverty 
problem as represented by the full NLSY sample. Suppose we cast this 
discussion in terms of the “swing.” The figure below contains the indi- 
cators that show the biggest swing. 

A swing from an average IQ of 97 to 103 in the NLSY reduccs the 
Proportion of people who never get a high school education by 43 per¬ 
cent, of persons below the poverty line hy 36 percent, of children liv- 
ing in foster care or with nonparental relatives by 38 percent, of women 
ever on welfare by 31 percent. The list goes on, and shows substantial 
reductions for other indicators discussed in Part II that we have not in- 
cluded in the figure. 

The nation is at a fork in the road. It will he moving somewhere 
within this ränge of possibilities in the decades to come. It is easy to un- 
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The swing in social problems that can result 
from small shifts in the mean IQ of a population 


Change wben the NLSY sample is 
altered so that the mean IQ is... 


103 









Permanent high sehool dropoms 

Men prevented from working by health problems 

Chi Idle n not living with either parent 

Males ever interviewed in jail 

Persons below the poverty line 
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W o m e n e v e r o n w e I f a re 

Women who beeame chronic welfare reeipienis 
Children born out of wedlock, of all live hinhs 
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derstand the historical and social rcasons why nohody wants ro talk 
ahout the demography of inteiligence. Our purpose has heen to point 
out that the stakes are large and that continuing to pretend that there s 
nothing worth thinking about is as reckless as it is foolish. In Part IV, 
wc öfter sume policies to point the eountry toward a brighter deimv 
grapbic future. 


Chapter 16 


Social Behavior and the 
Prevalence of Low Cognitive 
Ability 


J?i t/as chapter, the questum is not whether low cogiiitive ahility cause :s social 
Problems Hut the prevalence of low cogrmive ability among people who have 
those problems , It is an important distinetion. Causa! relationships are com- 
plex and hard to establish de finite ly. The measure of prevalence is more 
straightforward. For most of the warst social frroblems of our time , the peo¬ 
ple u’ho have the Problem are heavily concentrated in the lower pomori of the 
cognmic ability distributum. Any practical solution must therefore he capa - 
ble of succeeding with such people. 


T his chapter brings together the social hebaviors we covered in Part 
II from a new van tage point. The earlier chapters showed that low 
cognitive ability raises the risk of living in conditions or behaving in 
ways that society hopes to change. Now the question concerns preva- 
ience: To what extent does low cognitive ability describe the people thus 
aftlicted?The distinction is more familiär in the medical context. High 
cholesterol may he a risk factor for heart disease, but most people with 
heart disease may or may not have high cholesterol. If most people who 
have heart attacks do not have high cholesterol, then lowering the cho- 
lesterol of those with high levels will not do much to reduce the fre- 
quency of heart attacks in the population at large. Similarly, to the 
extent that low cognitive ability is prevalent among people who have 
the problems we hope to solve, policies that are effective for people with 
low scores should be soughr. 

The entire NLSY sample, including the AsiamAmericans, Amern 
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can Indians, and other ethnic groups rhat have hitherco been excludcd, 
are used he re. The proportions presented in this chapter are represen- 
tative ot America’s national population for an age cohort that was 26 
to 33 as of 1990. 

POVERTY 

In 1989, the official national statistics revealed that 11.1 percent of per¬ 
sons ages 25 to 34 ycars old were poor in that year, virtually idenrical 
with the 10.9 percent below the poverty line in the NLSY sample ages 
25 to 33. So while the NLSY cannot give us a precise figure for overall 
national poverty, there is no reason to think thar the results from it are 
misleading for young adults. This is in preface to the sobering figure that 
follows. 


Forty-eight percent of the poor in 1989 came 
from the bottom 20 percent in intelligence 


Persons in poverty Cumulative 

(bars) (line) 



IQ Decile 
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This graph uses conventions that we follow throughout the chapter: 
The headline gives the pereentage of the population in question (in this 
instance, the poor) in the bottom 20 percent of IQ, and the scale is the 
same for each graph. The bars show the pereentage of the poor popula- 
tion who come from each decile, marked by the scale on the left. If cog- 
nitive ability were irrelevant to poverty, the bars would be of equal 


height, each at just 10 percent. Adding up the percentages in each bar 
from left to right gives the cumulative pereentage, shown by the black 
line and the right-hand scale. For example, the first two deciles add up 
to 48 percent; therefore the black line crosses the 48 percent mark at 
the second har. The cumulative scale is a way of showing what propor- 
tion of poor people fall below any given decile. For example, in the case 
of poverty, almost 80 percent of poor people are in or below the fourth 
decile. If cognitive ability were irrelevant, the line would be a straight 
diagonal from lower left to the upper right. 

In terms of IQ points, the cognitive ability deciles in the figure above, 
as in all the others in the chapter, correspond to the scores in the tahle 
below. The bottom two deciles cut off IQ 87 and below and the top two 


IQ Equivalents for the Deciles 

Decile 

Range 

Median 

Ist 

Under 81 

74 

2d 

81-87 

84 

3d 

87-92 

90 

4rh 

92-96 

94 

5th 

96-100 

98 

6th 

100-104 

102 

7th 

104-108 

106 

8th 

108-11.3 

110 

9th 

113-119 

116 

lOth 

Ahove 119 

126 


deciles cut off IQ 11 3 and above. It may also be useful to recall that most 
College graduates and almost everyone with a Professional degree fall in 
the ninrh and tenth deciles. 

The figure teils us forcefully that poverty is concentrated among those 
with low cognitive ability. The mcan IQ of people below the poverty 
line was 88. A rhird of them came from the very bottom decile; they 
had IQs under 81. Eighty-two percent had below-average lQs. 

HIGH SCHOOL DROPOUTS 

It will come as no surprise to find that most high school dropouts have 
low intelligence. The figure below shows the results for persons who 
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Two-thirds of high school dropouts came 
from the bottom 20 percent in inteiligence 


Permanent high school dropouts Cumulative 

(bars) (line) 



IQ Decile 

drnpped out of school and did not suhsequently ohtain a GED. Overall, 
94 percent of rhose who permanent ly dropped out of school wo re he low 
average in IQ. As we noted in Chapter 6, this disproportion is not ma- 
terially affected hy analyses limited to persons who took the intelligence 
fest before they dropped out, so it cannot be explained by the eflects of 
a lack of schooling on their IQs. 

Those who drop out of school and later return to get their GED are 
markedly helow the mean of those who finish high school in the nor¬ 
mal way, but they are not as severely skewed toward the bottom end ot 
the distribution. Twenty-five percent are in the bottom two IQ deciles, 
and 69 percent are in the bottom half of the distribution. 

MEN AND WORK 

The Employed 

Year-round employment has only a minor assoriation with co^nitive ability. 
The figure helow, based on men who worked fifty-two weeks in 1989, 


makes this point plainly. We italicize it because, although it is consis- 
tent wirb the analysis presented for whites in Chapter 7, we want to 
emphasize that the same result applies across ethnic groups. 

Seventeen percent of the men who worked year round in 1989 
were in the bottom 20 percent of intelligence 

Men who worked 52 weeks in 1989 Cumulative 

(bars) (line) 



IQ Decile 


By and large, men who were employed rhroughout 1982 were spread 
across the full ränge of IQs, with only a minor elevation for those in the 
top four deciles. The mean IQ of year-round workers was 102. Those 
with low IQ have a statistically tougher time in many ways, but they 
contrihute very nearly their full share ro the population of men env 
ployed year round, an important fact to remember as a counterweight 
to most of the other findings in this chapter. 

Nunworkers 

The prototypical member of the underclass in the public imagination is 
a young male hanging out on the streets, never working. This amounted 
to very few men. Only 2.2 percent of NLSY men not in school and not 
prevented from working because of healrh problems failed to work at 
least a week in 1989. But among these 2.2 percent, low cognitive ahil- 
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ity predominated, The figure below, limited to civilian men out of school 
and not physically prevented from working, combines those who said 
they were unemployed and those who said they had dropped out of the 
labor force; their common denominator is that they reported zero weeks 
of working for 1989. The mean IQ of men who did not work at all was 
84. Fifty percent were in the bottom decile. Eighty-four percent were 
below average. 

Sixty-four percent of able-bodied men who did not work in 1989 
were in the bottom 20 percent of intelligence 


Able-bodied men who did not work Cumulative 



Ist 2nd 3rd 4th 5th 6th 7th 8th 9th 10th 


IQ Decile 

Unemployment 

Now we turn to the men not represented in either of the two figures 
above: men who worked at least some time during 1989 but were out of 
work for more than four weeks. There was somewhat more unemploy- 
ment among the lower deciles of IQ, as the figure below shows, but, as 
the almost straight diagonal line shows, the relationship was not strong. 
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Twenty-nine percent of able-bodied men who were temporarily out 
of work in 1989 were in the bottom 20 percent of intelligence 

Men out of work 4 wks. or more Cumulative 

(bars) (line) 



st 2nd 3rd 4th 5th 6th 7th 8th 9th lOth 


IQ Decile 

For these men, the mean IQ was 97, three points below average. If we 
were to add another graph, for men who were out of work for six months 
but not the full year, it would show a strenger relationship, about haltway 
between the graph just above and the variier graph for men who were 
out of the labor force all year. The general principle is that the longer 
the period of unemployment, the more prevalent is low IQ. Short-term 
unemployment is not conspicuously characterized by low IQ; long-term 
unemployment is. 

MEN AND CRIME 

The next figure contains the breakdown of the lQs of men in the NLSY 
who were inrerviewed in a correctional facility, showing that they had 
commltted at least one offenst' serious enough to get them lock cd up. 
The mean IQ of men who were ever interviewed in a correctional fa¬ 
cility was 84. Forty-five percent were concentrated in the bottom decile 
of cognitive ability. Ninety-three percent of the men were somewhere 
in the bottom half of the cognitive ability distribution. This high preva- 
lence of low IQ among offenders is consistent wirb other estimates in 
the literature, as summarized in Chapter 11. 
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Sixty-two percent of men ever interviewed in jail or 
prison came from the bottom 20 percent of intelligente 


Men ever interviewed in jail Cumuiative 

(bars) (line) 
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WOMEN AND WELFARE 

We Start with women who have ever received welfare. The data are 
shown in the figure below. Overall, the mean IQ of women who ever 
received welfare was 89. About 85 percent of them were below average 
in IQ, and fewer than 4 percent had lQs in the top two deciles. 

For chronic welfare recipients, defined as women who had received 
welfare for at least five years by 1990, the cognitive distribution was 
even lower.* 11 As the figure shows, 57 percent of chronic welfare moth- 
ers were in the bottom two deciles of IQ, 88 percent were in the bot- 
tom half of the distribution, and their mean IQ was 86. Just as low IQ 
was increasingly prevalent as the level ot male unemployment increased, 
so also is low IQ more prevalent among mothers as their dependency 
on welfare rises. 
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FortyTive percent of women who ever received welfare 
are in the bottom 20 percent of intelligence 


Wornen ever on welfare Cumuiative 
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Fifty-seven percent of chronic welfare recipients 
are in the bottom 20 percent of intelligence 



IQ Decile 
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OUTCOMES FOR CHILDREN 

In this section, we describe the prevalence of'Iow IQ among the moth- 
ers of children with various Problems, That is, we are presenting an aiv 
swer to the question, “If I am trying to deal with a certain problem 
regarding the children of young adults, what can I assume about the In¬ 
tel ligence of their mothers?” 

We begin with the overriding fact that, as of 1990, the NLSY moth- 
ers as a group were markedly below average in IQ. Their mean IQ was 
95.7. Fourteen percent of NLSY children were born to mothers in the 
bottom decile of IQ; 27 percent to mothers in the bottom twodeciles; 62 
percent to mothers in the bottom half of the distribution. Thus, for ex^ 
ample, a problem involving NLSY children will “ordinurily" show that 62 
percent of the children have mothers with below-average IQ. As will be 
dear, the observed proportions of low-lQ mothers are often considerahly 
elevated above that expectation. [2] But these benchmark figures must be 
kept in mind when interpreting all the analyses involving children. 

lllegitimacy 

We Start with the children who are born to unmarried women (see the 
figure below). The mean IQ of mothers of children born out of wed lock 

Fifty-two percent of illegitimate children were born 
to mothers in the bottom 20 percent of intelligente 


Children born out of wedlock Cumulative 

(bars) (line) 



was 87. m Of all illegitimate children in the NLSY sample, almost one 
out of three was born to a mother in the bottom 10 percent of the in¬ 
telligente distribution, with an IQ under 81, and 85 percent were born 
to women in the bottom half of the cognitive ability distribution. 

Restricting the analysis to those children who are most at risk, these 
percentages, already extreme, becoine even more bunched at the lower 
end of the distribution. Consider children who fit the archetype of the 
child at risk: born to a poor, single, teenage girl (with poverty measured 
in the year prior to giving birth). Almost two out of three (64 percent) 
of such children were born to women in the bottom 20 percent of the 
cognitive ability distribution. N inety-five percent of them were born to 
women in the bottom half. 

Othcr Fonn.s of Single Parenthood 

The figure below shows the proportion of NLSY children born to a mar- 
ried couple but living (in 1990) with just their mothers because of di- 
vorce or Separation. First, a caution: The profile we are about to present 


Thirty-one percent of children living with divorced or separated mothers 
had mothers with IQs in the bottom 20 percent of intelligence 


Children of divorced or separated parents Cumulative 
(bars) (line) 
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may change in the future beeause so many of the expecred divorces 
among the NLSY sample have not yet occurred. For women who had 
ever been married in the 25 to 33 age ränge as of 1990, we may, how- 
ever, ask: Among their children wbo were living in mother-only fami- 
lies as of 1990, what is the distribution of the mothers intelligente? 

Divorced and separated mothers averaged an IQ of 93. 141 More than 
hall of all children living with their divorced or separated mothers in 
the NLSY were born to women in the bottom 30 percent of the IQ dis¬ 
tribution. Seventy-six percent were born to women in the bottom half 
of the distribution. Remember that there is no contounding with ille* 
gitimacy; all children born out of wedlock are excluded from this sam¬ 
ple. The prevailing notion that Separation and divorce are so endemic 
that they aff ec t everyone more or less equally is wrong as regards cog- 
nitive ahility, at least in this age group. 

Perhaps the differences will even out to some extent in the long run. 
Brighter women get married and have their children later, ln the NLSY 
sample, their marriages have had less time to break up than those tor 
women lower in the distribution. Only time will teil whether and how 
rnuch the distribution in the graph above will change in the years to 
come. At this point, the skew is notable and clear. 

Pulling together the data on illegitimacy and other torms of single 
parenthood, here are a few key points: 

• Within the bottom two deciles of intelligence, illegitimacy is more 
common than divorce or Separation as the source of single 
parenthood. 

• Beginning with the third decile, divorce and Separation become 
an equal or predominant source of single parenthood. 

• The bottom half of the cognitive ahility distribution accounts for 
82 percent of all children in single-parent homes (comhining ib 
legitimacy with divorce or Separation) as of 1990. 

Lou^Birth-Wcig/u Babies 

Among whites, the chances of having a low-birth-weight baby were as- 
sociated with IQ, not socioeconomic background, when both variables 
were taken into ac.count (Chapter 10). The prevalence of low-birth- 
weight babies among women in the bottom half of the distribution per- 
sists when the entire NLSY sample is considered (the figure helow). 


Forty-five percent of low^birth-weight babies had 
mothers in the bottom 20 percent of intelligence 


Low-birth-weight babies Cumulative 

(bars) (line) 



IQ Decile 


Mothers with low-birth-weight babies averaged an IQ of 89. Almost 
three out of fnur (74 percent) mothers were in the bottom half of the 
IQ distribution. 

Deprived Home Environment s 

Chapter 10 discussed the HOME inventory, a measure comhining many 
indicators of both emotional support (for example, disciplinary style) 
and cognitive Stimulation (for example, reading to the child). Here, w ; e 
examine children whose HOME scores put them in the bottom 10 per¬ 
cent of environments (using national norms for the HOME inventory). 

The rnean IQ of mothers of children in the worst home environments 
was 86. Three out of eight had lQs below 81; 86 percent had IQs helow 
100. The figure helow combines the results for children in all age groups. 
There were some age differences, however: Generally, the concentra- 
tion of the worst environments among mothers with low cognitive ahil¬ 
ity got worse as the children got older. For children ages 3 to 5 who were 
in the worst home environments, 59 percent had mothers with IQs in 
the bottom two deciles. For children 6 and older, the figure was 65 per¬ 
cent. 
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Fifty-six percent of all children from bottom decile in home environ- Sixty-three percent of children who lived in poverty throughout the 

ment were born to mothers in the bottom 20 percent of intelligence first three y ears had mothers in the bottom 20 percent of Intelligence 


Children in the worst home environments Cumulative 
(bars) (line) 
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Children in Poverty 

The proportion of children living in poverty is one of the most fre- 
quently cited statistics in public policy debates and one of the most pow- 
erful appeals to action. In considering what actions might be taken, and 
what will and won't work, keep the following figure in mind, lt shows 
the distribution of maternal cognitive ability among children who spcnt 
their first three years below the poverty line. Mothers whose children 
lived in poverty throughout their first three years averaged an IQ of 84. 
Forty-one percent had mothers in the very bottom decile in cognitive 
ability. In all, 93 percent were born to women in the bottom half of the 
IQ distribution. Of all the social problems examined in this chapter, 
poverty among children is preeminently a problem associated with low 
IQ—in this case, low IQ among the mothers. 

Developmental Problems Among Children 

The prevalence of developmental problems among children is skevved 
toward the lower half of the IQ distribution. Rather than present graphs 
for each of them, the table below summarizes a consistenr Situation. See 


Children living in poverty for Ist 3 yrs. Cumulative 
(bars) (line) 
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Children with Developmental Problems 
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42 
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Chapter 10 for a description of the indexes. Low IQ is prevalent amori^ 
the mothers ot children with each of these developmenrai problems, hur 
none shows as strong a concentration as rhe developmentul indicator 
wc consider che most important: for eventual social adjustment: the 
childs own IQ. The figure below is limited to the cognitive ahility of 
children ages 6 and older when they took the test. 

Seventy-two percent of children in the bottom decile of IQ had 
mothers in the bottom 20 percent of intelligente 


Children in the bottom decile of IQ Cumulative 

(bars) (line) 
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IQ Decile 

The mean IQ ot mothers of children who scored in the hot tom decile 
of a childhood intelligence test was 81. hl Overall, 94 percent of these 
children had mothers with IQs und er 100. The extreme concentration 
of low IQ among the children of lowTQ mothers is no surprjse. Thar it 
is predictable does not make the future any brighter for these children. 
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CONCLUDING REMARKS 

Let us conclude on a brighter note, after so unrelenting a tally of Prob¬ 
lems. You will recall from Chapter 12 that we developed a Middle Class 
Values Index. To qualify for a score of “yes," an NLSY person had to he 
married to his or her first spouse, in the labor force (if a man), Hearing 
children within wedlock (ifa woman), and neverhave been interviewed 
in jail. How did the NLSY sample break down by IQ? The results are 
set out in the figure. 


Ten percent of people scoring “yes” on the Middle Class Values 
Index were in the bottom 20 percent of intelligence 


People scoring “yes” on the MCV Index Cumulative 
(bars) (line) 
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The mean IQ of those who scored “yes” was 104- Those in the bot- 
tom rwo deeiles conrributed only about 10 percent, halt of their pro¬ 
portional share. Those in the bottom half of the cognitive distribution 
contribured 37 percent. As in the case of year-round employment, the 
skew toward those in the upper half of the cognitive ahility distribution 
is not extreme. This reminds us again more general ly that most people 
in the lower half of the cognitive distribution are employed, out of 
poverty, not on welfare, married when they have their bahics, provid- 
ing a nurturing environment for their children, and obeying the law. 
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We must add another reminder, however. Thcre is a natural tendency 
to review these figures and conclude that we are real ly looking at the 
consequences of social and economic disadvantage, not intelligence. 
But in Part II, we showed that for virtually all of the indicators reviewed 
in this chapter, Controlling for socioeconomic Status does not get rid of 
the independent impact of IQ. On the contrary, Controlling for IQ oh 
ten gets rid of the independent impact of socioeconomic Status. We have 
not tried to present the replications of those analyses for all ethnic 
groups combined, but they teil the same story. 

The lesson of this chapter is that large proportions of the people who 
exhibit the behaviors and problems that dominate the nation’s social 
policy agenda have limited cognitive ability. Often they are near the de- 
finition for mental retardation (though the NLSY sample screened out 
people who fit the clinical definition of retarded). When the nation 
seeks to lower unemployment or lower the crime rate or induce welfare 
mothers to get jobs, the Solutions must be judged by their effectiveness 
with the people most likely to exhibit the problem: the least intelligent 
people. And with that, we reach the practica! questions of policy that 
will occupy us for the rest of the hook. 


PART IV 

Living Together 

Our analysis provides few clear and decisive Solutions to the major do¬ 
rnest ic issues of the day. But, at the same time, there is no major do¬ 
rnest ic issue for which the news we bring is irrelevant. 

Do we want to persuade poor single teenagers not to have babies? The 
knowledge that 95 percent of poor teenage women who have babies are 
also below average in intelligence should prompt skepticism about 
Strategien that rely on abstract and far-sighted calculutions of self-inter- 
est. Do we favor job training programs for chronically unemployed men? 
Any program is going to fail unless it is designed for a target population 
half of which has lQs below 80. Do we wish to reduce income inequal- 
ity? If so, we need to understand how the market for cognitive ability 
drives the process. Do we aspire to a “worid dass'’ educational System 
for America? Before deciding what is wrong with the current System, we 
had better think hard ahout how cognitive ability and education are 
linked. Part IV t.ries to lay out some of these connections, 

Chapter 17 summarizcs what we know about direct efforts to increase 
cognitive ability by altering the social and physical environment in 
which people develop and live. Such efforts rway succeed eventually, hut 
so far the record is spotty. 

Chapter 18 reviews the American educational experience of the past 
few decades. It has been more successful with the average and below- 
average Student than many people think, we conclude, hut has ne- 
glecred the gifted minority who will greatly affect how well America 
does in the twenty-first Century. 

In Chapters 19 and 20, the focus shifts to affirmative action policies 
in education and in the workplace. Our society has dedicared irself to 
coping with a particular sort of inequality, trying to equalize ourcomes 
for various groups. The country has retreated from older principles of 
individual equality before the law and has adopted policies that trear 
people as members of groups. Our contribution (we hope) is to calibrate 
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the policy choices associated with affirmative action, to make costs and 
benefits clearer than they usually are. 

The final two chapters look to the future. In Chapter 21, we sound a 
tocsin. Predictions are always chancy, and ours are especially glum, hut 
we think that cognitive stratification may be taking the country down 
dangerous paths. Chapter 22 follows up with our conception of a liberal 
and just society, in light of the story that the rest of the book has told. 
The result is a personal Statement of how we helieve America can face 
up to inequality in the 2Ist Century and remain uniquely America. 


Chapter 17 

Raising Cognitive Ability 

Rüisrng irueüigence sigrii/icantly, consistendy, and affordably would circum- 
vent many of the Problems diat we have described. Furthermore , che needed 
emaromnema! irn/rrovemern.s —better nutrition , stimulating environments for 
preschool children, good schools thereafter—seern ohvious. But raising int ei- 
ligence is not easy. 

Nutrition may oßer one of the more promising approaches. Height and 
weight have increased markedly wich better nutrition . Tfie rising IQs in many 
countries suggest that better nutrition may be increasing intelhgence too. Com 
trolled Studies have made some progiess in uncoveringa link hetween impi'uved 
nutrition and elevated cognitive ability as well , hut it remains unproved and 
not well understood. 

Forma! schoo/inq offers httle hope of narrowing cognitive inequality on a 
large scale in deieloped countries , be cause so much of its potential contribu- 
tion has already been realized with the advent of universal twelve-year Sys¬ 
tems. Special l>rngrams to improve Intelligence mithin the school have had 
Tnmor arul probably temporary effccts on mteüigence. There is more to be 
gained from educational research to find new methods of instruedon than from 
more interventions of the type already tried. 

Preschool has bome many of the recent hopes for improvmg intelligence. 
Howei'er, Head Start, the largest program, does not improve cognitive func- 
tionmg. More intensive, hence more costly, Irreschool programs may raise in- 
telligence f but both the size and the reahty of the nnfrrovements are in dispute. 

The one Intervention that works consistendy is adoption at birth from a had 
family environment to a good one. The average gairn in childhood IQ associ¬ 
ated with adoption are in the region ofsix points —not spectacular but not neg- 
ligible either. 

Taken tage ther, the story of attempts to raise intelligence is one of high 
hopes , flamhoyant claims, and disappointing results. For the foreseeable fu¬ 
ture, the Problems of low cognitive ability are not going to be solved by out- 
side interventions tu make children smarter. 
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C an people become smarter if they are given the right kind of Help? 

If raising intelligence is possible, then the material in Parts II and 
III constitutes a clarion call for programs to do so. Social prohlems are 
highly concentrated among people at the bottom of the cognitive dis- 
tribution; those problems become much less prevalent as IQ increases 
even modestly; and the history of increases in IQ suggests that they oc- 
cur most readily at the bottom of the distribution. Why not mount a 
major national effort to produce such increases? It does not appear on 
its face to be an impossible task. Even the highest estimates of heri- 
tability leave 20 to 30 percent of cognitive ability to be shaped by the 
environment. Some researchers continue to argue that the right Pro¬ 
portion is 50 to 60 percent. ln either case, eliminating the disadvan- 
tages that afflict people in poor surroundings should increase their 
cognitive functioning. 111 

Upon first consideration, the ways to eliminate those disadvantages 
seem obvious. Many children of low-income parents grow up in terrible 
home environments, with little Stimulation or nurturing. Surely, it 
would seem, intelligence would rise if these children were placed in day 
care environments where Professionals provided that Stimulation and 
nurturing. Schools in poor neighborhoods are often run down and 
chaotic. Isn’t it clear that increasing the Investment in schools would 
pay off in higher scores? 

Limitless possibilities for improving intelligence environmentally 
wait to be uncovered by Science: improved educational methods, diets, 
treatments for disease, prenatal care, educational media, and even med¬ 
ianes to make one smarter, ln principle, intelligence can be raised en¬ 
vironmentally to unknown limits. 

Yet the more one knows about the evidence, the harder it is to he 
opnmistic about prospects in the near future for raising the scores of 
the people who are most disadvantaged by their low scores. For one 
thing, it is hard to find new ways to use existing resources that are not 
already being done. The nurturing of the young—including the cog¬ 
nitive nurturing—is one of the central purposes of human society. 
That, after all, is what families mainly do. Very high proportions of 
children already get prenatal care, nutrition, home environments, and 
classroom environments that are good enough to leave little room for 
measurable improvement. The grim stories about childhood depriva- 
tion involve a small proportion of children. And when it comes to 
helping that small proportion of children, the results seldom approach 


expectations. We may be deeply and properly dissatisfied with the nur- 
turing of American intelligence, but finding Solutions that are afford- 
ahle, politically tolerable, and not already heing tried is another matter 
altogether. 

In this chapter, we move through a succession of topics. First we con- 
sider the effects of nutrition. We then discuss a sequence of successively 
more targeted, intense social interventions: education in general, 
preschool interventions, intensive Support for children at risk for retar- 
dation, and the most extreme form of social Intervention, adoption at 
birth. We close with our thoughts on what society s experiences with 
these interventions should mean for policy in the future. 

NUTRITION 

Most of us have heen urged by a patent or grandparenr to eat the “brain 
food,” which seemed invariably to be the most unpalatable thing on the 
table. This idea of a Connection between diet and intelligence has an 
ancient history going back to mens sana in corpora .sano. ul In the twen- 
tieth Century, the plausihility of a connection has heen reinforced by the 
facr that people in affluent countries are larger than their ancestors 
were, presumahly in parr because they are eating better. IQ scores, too, 
have heen rising during approximately the same period—the Flynn 
effecr descrihed in Chapter 13. These coincident cbanges do not prove 
that better eating makes for smarter people, but count as circumstantial 
evidence. 

For a while, however, scientific research seemed to have weakened 
the case for any link between nutrition and IQ. The most damaging 
blow was a study of over 100,000 Dutch men who were born around a 
time of intense famine in several Dutch cities near the end of World 
War II.* Nineteen years later, the men took intelligence tests as part of 
the qualification for national military Service, and it occurred to schol- 
ars to compare the ones who were born in the depths of the famine to 
those born just before and just after it. Many pregnant women miscar- 
ried during the famine, but their surviving sons scored no lower in in¬ 
telligence than the men born to mothers who had little or no exposure 
to famine. But as important as this study was, some scientists were not 
entirely convinced by its negative findings. The Dutch famine was rel- 
atively brief—three months or so—and limited to the pre- and perina- 
tal period of the mens lives. And while the mothers were indeed 
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starving for calories, their deficiencies in Vitamins, minerals, and other 
dietary elements were perhaps too brief to take a toll. 4 

AnotKer approach to the impact of nutrition on cognitive ability is 
to see whether enriched diets can raise scores. A breakthrough study 
done in Great Britain in the late 1980s concluded that the ans wer was 
yes. 5 David Benton and Gwilym Roberts gave a sample of thirty Welsh 
12- to 13-year-old children vitamin and mineral Supplements for eight 
months and compared their test scores with an equal number of their 
schoolmates getting nonnutritive placebos. The Welsh children were 
not known to be malnourished, but those getting the Supplement gained 
eight points more in their nonverbal intelligence test scores than those 
getting the placebo, a large and statistically significant improvemenr. 
Verbal scores showed no differential improvement. t6) 

A recem American confirmation of the Welsh results gave over 600 
eighth and tenth graders in several California schools daily pills for thir- 
teen weeks/ The pills contained either half the recommended daily ah 
lowances (RDA) of a wide assortment of Vitamins and minerals, 
precisely the RDA, twice the RDA, ora placebo. The vitamin and min¬ 
eral Supplement raised scores on most of the nonverbal subtests of a Stan¬ 
dard intelligence test. [s| The verbal intelligence test scores again faiied 
to register any benefit, but that is consistent with the Flynn effect: The 
rising average intelligence scores of nations seem primarily to he on non¬ 
verbal tests. 

The net average benefit for pills providing one RDA was about four 
points in nonverbal intelligence in the California study. But rhis aver¬ 
age gain comprised many youngsters who did not benefit at all, mixed 
with some whose gains exceeded fifteen points. The children who did 
not benefit were presumably already getting the vitamins and minerals 
they needed for developing their nonverbal scores in their regulär diets. 
But this is just a hypothesis at present, lt remains to be shown whether 
the gain from vitamins or minerals can be associated with preexisting 
food deficiencies, let alone which particular dietary ingredients, in what 
amounts, produce the gains.Youngsters getting exactly the RDA had 
the largest gain in scores; those taking either more or less of the Sup¬ 
plement benefited less, Lf at all. 1101 This is not only puzzling but worri- 
some. Could it mean that excessive dosages of vitamins and/or minerals 
harm intellectual functioning? There is no evidence that it does, but at 
the least, it reinforces the prudence of doing more research hefore go- 
ing overboard for vitamin and mineral Supplements. 


Other Physiologieal Influences on IQ. Or Are They? Two Further 

Examples 

The physiologieal eovironment. seems to be associated with IQ in other 
ways. For example, some studies (but not all) have found a small decline 
in IQ of each successive child born to a given woman, even after holding 
overall family size constanr. |li!| Is rhis a matter of the social environment 
within the family, which changes as new children enter it, or the physio- 
logical environment in the uterus, which is hoth older on average and has 
a longer hisrory of childbirth with each successive pregnancy ? The answer 
is unclear, and both views have been advanced. But, whichever it is, this 
would be a genuine environmental effect on intelligence, sinee the rolls of 
the generic dice for the successive offspring of a given mother and father 
are independent as far as anyone knows. 

AnotKer environmental and possihly physiological influence on IQ is 
suggested by data from twins. Among identical rwins, the one with the 
higher IQ is likely to have been heavier at birth. 1 ' This is parr of a more 
general finding that higher weights at birth are associated with higher IQs 
in childhood, hur the identical twin data decisivelv prove that the corre- 
lation between birth weight and later intelligence has an environmental 
elemenr, since identical twins are genetic clones. 14 It is less certain that 
there are no social factors here: People may treat twin hahies different ly if 
one is plumper than the other. Training mothers in how ro be more arten- 
rive to their low-birrh-weight hahies seems, in fact, to raise later IQ, at least 
up to the age of 7."^' 


This caution is reinforced by the inconsistency of the nurritional ef¬ 
fect on IQ. Many studies that seem to he well-conducted variations of 
the successful ones have faiied to demonstrate any effect on IQ at all. 11 
The reasonable middle ground at this point is to conclude that provid¬ 
ing children with the recommended daily allowance of vitamins is a 
good idea for many reasons and might also have a helpful effect on IQ. 

RAISING IQ THROUGH BETTER EDUCATION 

The almosr reflexive reaction of most people when they hear ahout the 
below-average test scores among children in the bottom of the socioe- 
conomic distribution is that of course they have low scores because they 
have gotten poor educations. Improve the schools, it is assumed, and 
the scores will rise. 
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There are a numher of prob]eins wich this assumption. One basic er¬ 
ror is to assume that new educational opportunities that suecessfully 
raise the average will also reduce differences in cognitive ability. Con- 
sider trying to raise the cognitive level by putting a public library in a 
community that does not have one. Adding the library could increase 
the average intellectual level, but it may also spread out the ränge of 
scores by adding points to the IQs of the library users, who are likely to 
have been at the upper end of the distribution to begin with. The lirer- 
ature on such U aptitude-treatment interacrions” is large and complex. 16 
For example, providing Computer assistance to a group of elementary 
school children leaming arithmetic incrcased the gapbetween good and 
bad students; u a similar effect was observed when Computers were used 
to teach readingQ the educational television program, “Sesame Street” 
increased rhe gap in academic performances between children from 
high- and low-status homes. w These results do not mean that such in- 
terventions are useless for the students at the bottom, but one must be 
careful to understand what is and is not being improved: The perfor- 
mance of those at the bottom might improve, but they could end up 
even further behind their brighter classmates. 

A second broad difficulty with relying on improvements in educa- 
tion is that although they make some difference in IQ, the size of the 
effect is small. This conclusion is supported by evidence from hoth nat¬ 
ural Variation in education and planned educational experiments. 

Looking at Natural Variation 

Patents huying new houses often pick the neighborhood according to 
the reputation of the local schools. Affluent parents may spend tens of 
thousands of dollars to put their children through private schools. Teil 
parents that the quality of the schools doesn’t matter, and they will 
unanimousty, and rightly, ignore you, for differences in schools do mat¬ 
ter in many important ways. But in affectmg IQ, they do not matter 
nearly as much as most people think. 

This conclusion was first and most famously reached hy a study that 
was expected to demonstrate just the opposite. The study arose out of a 
mandate of the Civil Rights Act of 1964 to examine how minority 
groups are affected by educational inequalities. The resulr was a huge 
national survey, with a sample that eventually numbered 645,000 stu¬ 
dents, led by the eminent sociologist James S. Coleman. His researchers 
measured school quality by such ohjective variables as credentials of the 


teachers, educational expenditures per pupil, and the age and quality of 
school fadlities. 

Recause the schools that most minority children attended were mea- 
surably subpar in facilities and staff, it was assumed that the minority 
children fortunate enough to attend better schools would also show im¬ 
proved cognitive functioning. But the report, issued in July 1966, an- 
nounced that it had failed to find any benefit to the cognitive abilities 
of children in public primary or secondary schools that could be cred- 
ited to better school quality. 1201 The usual ways in which schools tried to 
improve their effectiveness were not likely to reduce the cognitive dif¬ 
ferences among individual children or those between ethnic groups. 

The Coleman report’s gloomy conclusions were moderated in subse- 
quent analyses that found some evidence for marginal benefits of school 
quality on intellectual development. 1211 Coleman himself later con- 
cluded that parochial schools generally do a better job of developing the 
cognitive abilities of their students than public schools, which pointed 
to at least some factor in schooling that might be exploited to improve 
intelligence. 22 Yet rhe basic conclusion of the report has stood the test 
of time and criticism: Variations in teacher credentials, per pupil ex¬ 
penditures, and the other objective factors in public schools do not ae- 
count for much of the Variation in the cognitive abilities of American 
school children. 1231 

The several hundred thousand children assessed in the Coleman 
study had not heen subjects in educational experiments. They were just 
students in several thousand local schools. The schools varied in qual¬ 
ity, as they inevitably will. 24 Some schools, usually in prosperous urban 
or suburhan districts, got (and still get) more money, more teachers with 
better qualifications, newer school buildings, and the like. Poorer or 
rural districts usually made (and make) do with less. The Coleman re¬ 
port, in other words, is one ofaspecicsof educational research thatdraws 
on natural Variation —Variation that is occurring spontaneously rather 
than by design. 

Looking at the effects of natural Variation has advantages as a re¬ 
search strategy. One is that this kind of research does not require new 
Investments of time and money to intervene in schools. The interven- 
ing has already been done at someone eise’s expense. The disadvantage 
of such studies is that the Variation is often narrow—an example of the 
restriction of ränge problem that we described in Part I. If almost all 
classes have, say, between twenty-five and thirty-five children in them, 
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then looking at natural Variation cannot reveal what would happen in 
classes with five or ten children in them. The Coleman report did not 
prove that educational reform is always futile, but that, on the whole, 
America had already achieved enough objecrive equalization in its 
schools by 1964 so that it was hard to pick up any effects of unequal 
school quality. The Coleman report teils us that the cognitive ahility 
differences among individuals and groups alike on a national scule can¬ 
not he reduced much hy further attempts to equalize the kinds of bricks- 
and-mortar factors and teacher credentials that school boards and 
taxpayers most often concern themselves with. 

Aside from the issue of school quality is the question of whether sim- 
ply going to school makes any difference to one s intelligence. The an- 
swer is self-evidently yes. Going to school and learning how to read and 
write, manipulate numbers, find out about one s culture and about the 
discoveries of Science are going to raise scores on IQ tests compared to 
not going to school. But although it is obvious that schooling itself fos- 
ters intelligence, it is far less obvious how much of the intellectual Vari¬ 
ation around us can be attributed to differences in the amount of 
schooling people get. If large numbers of people were getting no school- 
ing at all, there would be cognitive disadvantages on a grand scale that 
could be hlamed on a lack of formal education. But in modern coun¬ 
tries, natural Variation does not span so wide a ränge. 

An example of a study that had enough natural Variation in it to find 
an eff ec t of schooling was done in Sweden a half-century agoT IQ tests 
w r ere given Ln 1938 to a representative sample of several hundred 10- 
year-old boys in public and private schools in a Swedish city. Ten years 
later, the boys were tested again as part of an induction examination for 
national military Service, ln addition to the two IQ scores, the boys’ 
home and family hackgrounds and the total years of schooling were 
availahle for analysis. 

The average subject in the study had completed only eight years of 
schooling, which means that many of them had completed fewer. Fewer 
than 10 percent of them had finished high school, and still fewer had 
gone on to university. Compared to present-day Sweden or America, 
the men experienced a wide ränge of years in school. Even st), the main 
determiner by far of IQ at rhe age of 20 was the IQ at the age of 10, by 
a factor of more than five times as important as years of schooling. 1 “'' 1 
On the other hand, schooling was a significant though much weaker 
predictor, aff er Holding IQ at age 10 and family background constant. 


Since there was somc beneficial effect of schooling, the results of the 
study were properly used to argue that additional years of school would 
pay off in higher scores. 

We can infer from the Swedish study that some of the Flynn effect 
around the world is explained hy the upward equalization of schooling, 
hur a hy-product is that schooling in and of itself no longer predicts adult 
intelligence as strongly, assuming it did so when many people were not 
getting much schooling. |J/| The more uniform a country’s schooling is, 
the more correlated the adult IQ is with childhood IQ. 

The average American now gets more than three extra years of 
schooling compared to the time when the earliest intelligence tests were 
given. To he sure, years spent in school still varies in America, and it is 
presumahly still conrrihuting to Variation in cognitive abilities. 12 * 1 But 
given how small the effect was in the Sweden of the 1930s and 1940s, 
it is unlikely to be large in America today, given the enormous com- 
pression of educational Variation in America during the twentieth Cen¬ 
tury (see Chapters 1 and 6). Nevertheless, we accept the hasic premise 
that Variation in the amount of schooling accounts for some portion of 
rhe observed Variation in cognitive ahility. Besides not knowing how 
large rhis remaining effect is, it ishard to estimate how much more would 
be gained on the average by furrher equalization of years of schooling. 
C3ains reaped at the hottom of the cognitive ahility distrihution may he 
paid for hy losses at rhe top, a process we discuss in the next chapter. 

School differences can nonetheless be important. If a child is near 
the top of the intelligence distrihution to hegin with, the school can 
make a major difference in whether thar intellectual talent is actually 
realized, a topic we consider in the next chapter. Or if a child has spe¬ 
cific learning disahilities, access to the latest pedagogical techniquesand 
Technology may make a major difference. There doubtless are, in addi¬ 
tion, pockets in Americas vast educational realm where schools are un- 
commonly good or uncommonly poor, in which rhe children are 
henefiting or suffering cognitively. By definition, however, these are un- 
usual cases, not likely to show up in national data on intelligence. 

This discussion has not meant to imply that the fostering of cogni¬ 
tive ahility is the only rcsult we want from schools. The civility, let alone 
the safety, of the environment may vary widely from school to school. 
Skillful teachers may make learning more interesring. They may infuse 
children with a love of learning to some extent. These are effects worth 
worrying about, but they do not alter the fundamental message that the 
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data convey: Equalizing the amount or ohjective quality of schooling in 
America cannot be counted on to equalize cognitive ability much. 

Compensatory Education 

Just a year prior to the Coleman report, the U.S. Congress passed the 
Elementary and Secondary Education Act (ESEA) of 1965, thereby 
opening a massive and continuing effort to improve the education of 
disadvantaged students that continues to this day. ln the first fiscal year, 
grants for educationally deprived children under Title I of the ESEA 
went from zero to $3 billion, rose to $4 billion in the next year, and have 
remained there, or higher, ever smce. Expenditures in fiscal 1992 were 
at an all-time high ot $5.6 billion (all tigures are in 1990 dollars)T ; 

Sponsors of Title I assumed that these programs would narrow the gap 
in cognitive lunctioning between disadvantaged children and other stu- 
dents. To prove this, the act also funded an aggressive, ongoing evalua- 
tion effort, resulting over the years in a mounting stack of reports. ln the 
mid-1970s, the National Institute of Education (NIE) commissioned a 
synthesis of the results. Reviewing all the federal studies from 1965 to 
1975, researchers found no evidence that students in compensatory ed- 
ucation programs closed the gap with their more able peers. Some plau¬ 
sible data suggested that “students in compensatory programs tend to fall 
behind other students, but not as fast as if they had received no com- 
pensatory instructions,” an outcome that the Institute treated as evi¬ 
dence of success. 30 The greatest Support in the various studies was for a 
simpler u no effect” conclusion: The gap was about as great after compen¬ 
satory education as before. Iul No evidence whatsoever supported a con¬ 
clusion that compensatory education narrowed the achievement gap. 

More optimistically, supporters of compensatory education can call 
upon the evidence of eonverging black-white test scores that we de- 
scribed in Chapter 13 as indirect evidence that somerhing positive has 
been happening in elementary and secondary education for minorities. 
As we described, improvement has been the largest at the bottom of the 
IQ distribution, which in turn points toward compensatory programs as 
a possible cause. Bur direct evidence of the link remains elusive. ln re- 
cent years, compensatory programs have set more modest goals, for 
themselves. 1121 Now, they focus on teaching specific academic skills or 
problem solving, not expecting improvements in overall academic 
achievement or general intelligenceT 


Stories Too Good to Be True 


Accounts of phenomenal success stories in education—the inner-city 
school that suddenly excels as the result of a new program or a new 
teaeher—are a perennial fixture of American journalism. Are they true? If 
the quesrion is whetheran inspirational teaeher or some new program has 
the capacity to make an important difference in students’ lives, then the 
answer is surely yes. But claims for long-term academic improvement, let 
alone incrcases in cognitive functioning, typically fade as soon as hard 
questions heg in to be asked. A case in point is Chicagos Marva Coüins, 
who gained national attention with claims that her sht restring-hudget in- 
ner-ciry school, launched in 1975, was turning out students who blew the 
top off Standard ized tests and were heading to the best, universities. Be¬ 
tween the ages of 5 and 10, she claimed, her pupils, deemed “unreachable” 
in regulär schools, were reading Plato, Aristotie, Chaucer, Shakespeare, 
and Tolstoy, according to stories in the populär media. According to other 
newspaper reports, she was asked by both President* Reagan and Clinton 
to become secretary of education. She continues to train large numbers of 
teachers in her methodsT Are her celebrated aneedotes borne out by data? 
We do not know. Despite years of Publicity about Marva Collins, we can 
find no hard evidence. ^ 

More general ly, the large test score increases in local schools that are 
widely and routinely reported by the media have been plagued by fraud. ln 
several schools in and around Washington, D.C., for example, the Wash¬ 
ington Post reported that gains in test performance were found to be due to 
improper coaching on the tests by school employees or by allowing extra 
time for students to ccunplete the tests.A story in the Los Angeles Times 
told of various methods of cheatingon standardized tests, including the re- 
placing of wrong answers with right ones by teachers and staff, in at least 
fifty elementary public schools statewkleT The New York Times wrote 
about a public school principal who had been caught tampering with Stu¬ 
dent test scores for years.' 8 These specific instances seem ro be part of a 
widespread problem. 


Raising IQ Among the SchooEAged: Converging Results from Two 
Divergent Tries 

The quesrion remains: Is there any evidence that cognitive ability as 
measured by IQ tests can be increased by special interventions after chil¬ 
dren reach school age? We have some reason for thinking the answer is 
a highly qualified yes, and some basis for estimating how much, from 
two sources of evidence drawn from strikingly different contexts. 
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The first is one of the kargest controlled experiments attempting ex- 
plicitly to raise the intelligence of school-age children. It occurred in 
Venezuela, where in 1979 the incoming president named to his cahiner 
a Minister of State for the Development of Human Intelligence. 1401 The 
new minister was convinced that a nation’s average intellecrual level 
was fundamental to its well-being, and he set out to see what could he 
done to raise the IQ of Venezuelan school children. The result was Pro- 
ject Intelligence, designed over four years by a team of Venezuelan and 
American psychologists, educators, and other speeialists. In the fifth 
year, 900 youngsters in seventh grade in a poor district of a Venezuelan 
provincial city were random ly divided into experimental and control 
groups. 1411 Those in the experimental group were taught approximately 
sixty forty-five-minure lessons in addition to their regulär Curriculum 
during the year and were cognitively rested hefore, during, and after the 
year. The students in the control group were tested at the samt intet' 
vals, without receiving any of the additional instruction. The special 
lessons involved instruction in the kinds of intellecrual activities that 
turn up on intelligence tests—visuospatial and verbal reasoning, vo- 
cabulary and word analogies—in addition to lessons in inventive think' 
ing. l4Z1 At the end of the year, the youngsters in the experimental group, 
compared to the Controls,, had gained a net of more than 0.4 Standard 
deviation on a conventional intelligence test and a net gain of just over 
0.1 Standard deviation on a culture-fair intelligence test—in other 
words, a net gain in the ränge between 1.6 and 6.5 IQ points. There was 
no chance to see if the gain faded out or was reflected in the rest of the 
students' academic performance, nor can we even guess how much a 
second or third year of lessons would have accomplished. 

The second source of evidence comes from the unsystematic hut mas¬ 
sive attempt to raise intelligence that goes on in the innumerahle com- 
mereial coaching Services promising to raise SAT scores. Few people 
think of the prep courses in that way. On the surface, it is all ahout get- 
ting into the College of your choice. But raising an SAT is just like rais- 
ing an IQ if the SAT is an intelligence test and, however adroitly the 
current officials of the College Board and the admissions officers in uni- 
versities try to avoid saying so, the SAT is partly an intelligence test. 14,1 

Can the SAT he coached? Yes, hur it is not easy. Everyone who looks 
into this topic immediately hears about students who gained 10C, 150, 
or 200 points on the SAT after a few hours of coaching. The rales may 
even he true, but they need to be averaged with the tales that don’t get 


told about the scores that improve by only a few points—and the scores 
that drop—after spending a few dozen hours and hundreds of dollars on 
a coaching course. Scholars have by now largely sorted out the reality 
hehind the sales pitches. After a furious debate about the issue in the 
late 1970s and early 1980s, the best evidence indicates that the coach¬ 
ing programs which can offer convincing scientific backing for their 
claims consisr not of a few hours of practice but of lengthy training, com- 
parable to going to school full time. 44 ln the best of these analyses, 
Samuel Messick and Ann Jungeblut reviewed the published studies on 
coaching for the SAT, eliminated the on.es that were methodologically 
unsound, and estimated in a regression analysis the point gain for a given 
number of hours spent studying for the test. 4 ' Their estimate of the ef- 
fect of spending thirty hours on either the verbal or math test in a coach¬ 
ing course (including homework) was an average of sixteen points on 
the verbal SAT and twenty-five points for the math SAT Larger In¬ 
vestments in time earn larger payoffs with dinvinishing returns. For ex- 
ample, 100 hours of studying for either test earns an average twenty-four 
points on the verbal SAT and thirty-nine points on the math SAT The 
next figure summarizes the results of their analysis. 

Studying really does help, but consider what is involved. Sixty hours 

The diminishing returns to coaching for the SAT 


Average improvernein in SAT points 
60 - 



Sowrcd: Messick and jungehlur 1981, Figs. 1, 3. 
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of work is not a trivial investment of time, but it buys (on average) only 
forty-one points on the combined Verbal and Math SATs—typically not 
enough to make much difference if a Student is trying to impress an ad- 
missions Committee. Even 300 hours—and now we are talking about 
two additional hours for 150 school days—can be expected to reap only 
seventy additional points on the combined score. And at 300 hours (150 
for each test), the Student is already at the flat part of the curve. Dou¬ 
ble the investment to 600 hours, and the expected gain is only fifteen 
more points. 

Although intended for utterly different purposes, the benefits of the 
Venezuelan program and of SAT coaching schools are remarkably sim- 
ilar. The sixty lessons of the Venezuelan course, representing forty-five 
hours of study, added between . 1 and .4 Standard deviation on various 
intelligence tests. From the figure on SAT coaching, we estimate that 
45 hours of studying adds about .16 Standard deviation to the Verbal 
score and about .23 Standard deviation to the Math score. [46) 

These increases in test scores represent a mix of coaching effects— 
cramming is the process, with a quite temporary effect, that you may 
remember from school days—and perhaps an authentic increase in in¬ 
telligence. We also are looking at short-term results here and must keep 
in mind that whenever test score follow-ups have been available (see 
the next section), the gains fade out. The net result is that any plausi¬ 
ble estimate of the long-term increase in real cognitive ability must be 
small, and it is possible to make the case that it approaches zero. 

Taken together, the negative findings about the effects of natural 
Variation in schools, the findings of no effect except maybe to slow the 
falling-behind process in the evaluations of compensatory education, 
and the results of the Venezuelan and SAT coaching efforts all point to 
the same conclusion: As of now, the goal of raising intelligence among 
school-age children more than modestly, and doing so consistently and 
affordably, remains out of reach. 

HEAD START AND ITS SOMETIMES DISTANT RELATIVES 

Düring the 1970s when scholars were getting used to the disappointing 
results of programs for school-age children, they were also coming to a 
consensus that IQ becomes hard to budge at about the time children go 
to school. Longitudinal studies found that individual differences in IQ 
stabilized at approximately age 6. 47 Meanwhile, developmental psy- 


chologists found that the year-to-year correlations in mental test per- 
formance were close to zero in the first few years of life and then rose to 
asymptotic levels by age 6* 48 These findings conformed with the intu¬ 
itive notion that, in the poet’s words, “as the twig is bent the tree’s in- 
clined.” [491 Any Intervention designed to increase intelligence (or 
change any other basic characteristics of the child) must Start early, and 
the earlier the better. 50 Here, we will characterize the more notable at- 
tempts to help children through preschool interventions and summa- 
rize the expert consensus about them* 

Preschool Programs for Disadvantaged Children in General 

HEAD Start* One of the oldest, largest, and most enduring of the Con¬ 
temporary programs designed to foster intellectual development came 
about as the result of the Economic Opportunity Act of 1964, the open- 
ing salvo of Lyndon Johnsons War on Poverty. A year later, the man- 
dated executive agency, the Office of Economic Opportunity, launched 
Project Head Start, a program intended to break the cycle of poverty by 
targeting preschool children in poor families* [51] Designed initially as a 
summer program, it was quickly converted into a year-long program pro- 
viding classes for raising preschoolers’ intelligence and communication 
skills, giving their families medical, dental, and psychological Services, 
encouraging parental involvement and training, and enriching the chil- 
dren’s diets* [52] Very soon, thousands of Head Start centers employing 
tens of thousands of workers were annually spending hundreds of mil- 
lions of dollars at first, then billions, on hundreds of thousands of chil¬ 
dren and their families* 

The earliest returns on Head Start were exhilarating* A few months 
spent by preschoolers in the first summer program seemed to be pro- 
ducing incredible IQ gains—as much as ten points* [53] The head of the 
Office of Economic Opportunity [54] reported the gains to Congress in 
the spring of 1966, and the program was expanded. By then, however, 
experts were noticing the dreaded “fade-out,” the gradual convergence 
in test scores of the children who participated in the program with com- 
parable children who had not* To shorten a long story, every serious at- 
tempt to assess the impact of Head Start on intelligence has found 
fade-out* [551 Cognitive benefits that can often be picked up in the first 
grade of school are usually gone by the third grade* By sixth grade, they 
have vanished entirely in aggregate statistics* 
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Head Starr programs, administered locally, vary great ly in quality. 
Perhaps, some have suggesred, the good programs are raising intelli- 
gence, but their impact is diluted to invisibility in national statisticsa 
That remains possible, but it becomes ever less probable as time passes 
without any clear evidence for ir emerging. To this pennt, no lasting 
improvements in intelligence have ever been statistically validated wich 
cmy Head Start program. Many ot the commentators who praise Head 
Start value its family counselingand public Health benefits, while grant- 
ing that it does not raise the intelligence of the children. 

One response to the disappointment ot Head Start has been ro rede- 
fine its goals. Instead of raising inrelligence, Contemporary advocates 
say it reduces long-term school failure, crime, and illegitimacy and im- 
proves employability.^ 8 * These delayed benefits are called sleeper effects, 
and they are what presumahly justify the frequent public assertions that 
“a dollar spent on Head Start earns three dollars in rhe future,’ 1 or words 
to that effect. |Sg| But even these Claims do not survive scrutiny. The ev¬ 
idence for sleeper effects, such as it is, almost never comcs from Head 
Start programs themselves but from more intensive and expensive 
preschool Interventions. 1(101 

PHRRY PRFSCHOOL. The study invoked most often as evidence that \ lead 
Start works is known as the Perry Preschool Program. David Wcikart 
and his associates have drawn enormous media attention for their study 
of 123 black children (dividecl into experimental and control groups) 
from the inner city in Ypsilanti, Michigan, vvhose IQs measured hetvveen 
70 and 85 vvhen they were recruited in the early 1960s ac the age of 3 
or Fifty-eight children in the program received cognitive instruc- 
tion five half-days 1621 a weck in a high ly enriched preschool seit ing ior 
one or two years, and their homes were visited by teachers weekly for 
further instruction of parents and children. The teacher-to-child ratio 
was high (abour one to five), and mast of the teachers had a masters 
degree in appropriate child development and social work tields. Perry 
Preschool resembled the average Head Start program as a Ferrari re- 
sembles the family sedan. 

The fifty-eight children in the experimental group were compared 
with another sixty-five who served as the control group. By the end of 
their one or two years in the program, the children who went to 
preschool were scoring eleven points higher in IQ than the control 
group. But hy the end of the second grade, they were just marginally 


ahead of the control group. By the end of the fourth grade, no signifi- 
canr difterence in IQ remained. |M1 Fadeout again. 

Although this intensive attempt to raise intelligence failed to pro- 
duce lasting IQ gains, the Ypsilanti group believes it has found evidence 
for a higher likelihood of high school graduation and some post-high 
school education, higher employment rates and literacy scores, lower 
arrest rates and fewer years spent in special education classes as a result 
of the year or two in preschool. The effects are small and some of them 
fall short of Statistical significance. lMl They hardly justify investing bil- 
lions of dollars in run-of-the-mill Head Start programs. 

OtHFR LoNC'.ITUIMNAL STUDIES OF PrHSCHIX)L PROGRAMS. One Prob¬ 
lem faced hy anyone who tries to summarize this literature is just like 
that faced hy people trying to formulate public poliey. With hundreds 
of studies making rhousands of Claims, what can he concluded? We are 
fortunate to have the benefit of the efforts ot a group of social scientists 
known as the Consortium for Longitudinal Studies. lnitially conceived 
by a Cornell professor, Irving Lazar, the consortium has pulled together 
the results of eleven studies of preschool education (including the Perry 
Preschool Project), chosen hecause they represent the best availahle sci- 
entifically.^ None of them was a Head Start program, but a few were 
elahorations of Head Start, upgraded and structured to lend themselves 
to evaluation, as Head Start programs rare ly do. The next figure sum- 
marizes the cognitive outcomes in the preschool studies that the con- 
sorrium deemed suitable for follow-up IQ analysis. The reported changes 
control for pretest IQ score, mothers education, sex, number ot siblings, 
and father presence. 

Soon after completing one of these high-quality experimental 
preschool programs, the average child registers a net gain in IQ of more 
than seven IQ points, almost half a Standard deviation. The gain shrinks 
to four to five points in the first two years after the program, and to about 
three points in the third year. 1<>6i The consortium also collected later fol¬ 
low-up data that led the researchers to conclude that “the effect of early 
education on intelligence fest scores was not permanent. n,6/! 

Intensive Interventions for Children at Risk of Mental Retardation 

The preschool programs we have just descrihed were targeted at disad- 
vantaged children in general. Now we turn to two studies that are more 
intensive than even the ones analyzed hy the consortium and deal with 



406 LivingTogether 


Raising Cognitive Ability 407 


IQ gains attributable to the Consortium preschool projects 


Median gain in IQ points 
8 - 

7- 
6 - 
5- 
4- 
3- 
2 - 
1 - 

Exit lest L year 2 years 3 years 

Period After the End of the Program 

Source; Lazar and Darlington 1982,Table 15. 

children who are considered to be at high risk of mental retardation, 
based on their mothers’ low lQs and socioeconomic deprivation. 

A case can be made for expecting interventions to be especially ef- 
fective for these children, since their environments are so poor that they 
are unlikely to have had any of the benefits that a good program would 
provide. Moreover, if the studies have control groups and are reason- 
ably well documented, there is at least a hope of deciding whether the 
programs succeeded in forestalling the emergence of retardation. We 
will briefly characterize the two studies approximating these conditions 
that have received the most scientific and media attention. 

The AbecedarIAN PROJECT. The Carolina Abecedarian Project started 
in the early 1970s, ander the guidance of Craig Ramey and his associ- 
ates, then at the University of North Carolina. 68 Through various so¬ 
cial agencies, they located pregnant women whose children would he at 
high risk for retardation. As the babies were bom, the ones with obvi- 
ous neurologic disorders were excluded from the study, but the remain- 
der were assigned to two groups, presumably randomly. In all, there were 
four cohorts of experimental and control children. Both groups of ba¬ 
bies and their families received a variety of medical and social work Ser¬ 
vices, but one group of babies (the “experimentals”) went into a day care 


program. The program started when the babies were just over a month 
old, and it provided care for six to eight hours a day, five days a week, 
fifty weeks a year, emphasizing cognitive enrichment activities with 
teacher-to-child ratios of one to three for infants and one to four to one 
to six in later years, until the children reached the age of 5. It also in- 
cluded enriched nutrition and medical attention until the infants were 
18 months old. 69 The Abecedarian Project is the apotheosis of the day 
care approach, This is extremely useful from a methodological perspec¬ 
tive: Even if the nation cannot afford to supply the same Services to the 
entire national population of children who qualified for the Abecedar¬ 
ian Project, it serves as a way of defining the outer limit of what day care 
can accomplish given the current state of the art. 

At the end of the fifth year, the children receiving the day care 
outscored those who did not by half a Standard deviation on an intelli- 
gence test. At last report, the children were 12 years old and were still 
doing better intellectually than the Controls. Combining all the cohorts, 
only 28 percent of the experimental children had repeated a grade, com- 
pared to 55 percent of the control children. Only 13 percent of the ex¬ 
perimental children had IQs of less than 85, compared to 44 percent of 
the control children. 70 

This would be unequivocal good news, except for charges that the 
two groups were not comparable in their intellectual prospects at birth. 
lgnoring the more technical issues, the major stumbling block to de¬ 
ciding what the Abecedarian Project has accomplished is that the ex¬ 
perimental children had already outscored the Controls on cognitive 
performance tests by at least as large a margin (in Standard score units) 
by the age of 1 or 2 years, and perhaps even by 6 months, as they had 
after nearly five years of intensive day care. t? 11 There are two main ex- 
planations for this anomaly. Perhaps the intervention had achieved all 
its effects in the first months or the first year of the project (which, if 
tme, would have important policy implications). Or perhaps the ex¬ 
perimental and control groups were different to begin with (the sample 
sizes for any of the experimental or control groups was no larger than 
fifteen and as small as nine, so random selection with such small num- 
bers gives no guarantee that the experimental and control groups will 
be equivalent). To make things still more uncertain, test scores for chil¬ 
dren younger than 3 years are poor predictors of later intelligence test 
scores, and test results for infants at the age of 3 or 6 months are ex- 
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tremely unreliable. It would therefore be dlfficult in any case to assess 
the random placement from early tesr scores. The debate over the re- 
sults is ongoing and unresolved as we write. 

THE MILWAUKEE Project. The Abecedarian Project was inspired hy an 
earlier attempt to forestall mental retardation in a population of 
children who were at high risk. The famous Milwaukee Project started 
in 1966 under the supervision of Richard Heber, a professor at the 
University of Wisconsin (Madison) who had been research director of 
President John F. Kennedys panel on mental retardation at the 
beginning of the decade. Healthy babies of poor black mothers with 
lQs below 75 were almost, but not quite, randomly assigned to no day 
care at all or day care starting at 3 months and continuing until rhey 
went to school. The day care lasted all day, five days a week, all year. 
The families of the babies selected for day care received a variety of 
additional Services and health care. The mothers were paid for 
participation, received training in parenting and job skills, and their 
other young children received free child care. Only thirty-five 
children are considered to have completed the study, seventeen 
receiving the special attention and the remainder serving as Controls. 

Soon after the Milwaukee project began, reports of enormous net 
gains in IQ (more than 25 points) started appearing in the populär me- 
dia and in psychology textbooks.'“ However, there was a dearth of pub- 
lication that allowed experts to evaluate the project. The few technical 
items that appeared raised more questions than they answered.'* 1t was 
not until 1988 that another Wisconsin professor associated with the 
work, Howard Garber, published an interpretable analysis of what had 
been done in the Milwaukee Project and what was found.*' 41 

By the age of 12 to 14 years, the children who had been in the pro- 
gram were scoring about. ten points higher in IQ than the Controls. Com- 
pared to other early interventions, this is a notably large differente. But 
this increase was not accompanied by increases in school performance 
compared to the control group. Experimental and control groups were 
both one to two years retarded in reading and math skills by the time 
they reached fourth grade; their academic averages and their achieve- 
ment scores were similar, and they were similarly rated by their teach- 
ers for academic competence. From such findings, psychologists Charles 
Locurto and Arthur Jensen have concluded that the programs substan- 
tial and enduring gain in IQ has been produced by coaching the chil¬ 


dren so well on taking intelligence tests that their scores no longer rnea- 
sure intelligence or g vcry well. ln| Time will teil whether a more hope- 
ful conclusion can be drawn. 

In summary, the two experiments contain soine promising leads. But 
it is not obvious where to go from here, for they differed in possibly im¬ 
portant ways. The Abecedarian Project evaluated day care; the Mil¬ 
waukee Project provided numerous interventions besides day care, 
including parental payment and training. Ir is hard to teil whether the 
formet tound enduring IQ benefits, given the very early divergence in 
tesr scores for experimental and control groups, but some academic ben¬ 
efits; the latter found an enduring IQ gain, but has not yet shown com- 
parable intellectual gains in school work. It may be relevant that the 
Abecedarian mothers had higher lQs than the Milwaukee mothers, so 
the children may not have been at equal risk for retardation. 

Reading this history of interventions, you may have noticed a curious 
parallelism: In the media, the good news is trumpeted as if there were 
no ambiguity; in the technical Journals, the good news is viewed with 
deep suspicion and discounted. Are the scholars as excessively nitpick- 
ing as the journalists are credulous? Here is the difficult-to-discuss prob- 
lem that overhangs the Interpretation of these results: The people who 
run these programs want them to succeed. This is hardly a criticism. 
People who are spending their lives trying to help disadvantaged chil¬ 
dren ought to be passionately committed to their success. But it is hard 
for them to turn around and be dispassionate about the question, “How 
well are we doing?” Often the raw data from these programs are not eas- 
ily aceessible to outsidc scholars. Not infrequently, when such data fi- 
nally are made available, they reveal a different and less positive way of 
viewing the successful results than the one that had previously been 
published. 

Consensus has thus heen hard ro reach, but progress is being made. 
ln our account, we have avoided dwelling on technical problems that, 
though perhaps valid, would modify the results only at the margin. 
When we have alluded to uncertainties and methodological difficulties, 
we have restricted ourselves to clear potential problems, which, if true, 
seriously weaken rhe Basis for claiming success. ln other words, we have 
tried to avoid nitpicking. The fact is that we and everyone eise are far 
from knowing whether, let alone how, any of these projects have in- 
creased intelligence. We write this pessimistic conclusion knowing how 
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many ostensibly successful projects will he cited as plain and indis- 
putable evidence that we are willfully refusing ro see the light. 

CHANGING THE ENVIRONMENT AT BIRTH 

There is one sure way to transform a child’s environment beneficially: 
adoption out of a bad environment into a good one. lf adoption oecurs 
at birth, it is at least possible that the potential effects of postnatal en¬ 
vironmental disadvantage could be wiped out altogether. 1 ' 61 The specific 
question now is: How many points does being raised in a good adoptive 
home add to an IQ score? 

Children are not put up for adoption for the editication ot social the- 
orists. There are no controlled experiments on the effects of adoption. 
Adoption usually means trouble in the biological fainily; trouble usu- 
ally lands on families nonrandomly and unaccountably, making h hard 
to extract clear, generalizable data. The most famous studies were mostly 
done decades ago, when the social and financial incentives for adoption 
were different from today’s. Legalized contraception and abortion, too, 


When Environment Is Decisive 


Lest anyone doubt rhar environment matters in the development of inrel- 
ligence, consider the rare and bizarre cases in which a child is hidden away 
in a locked room by a demented adult or breaks free of human contaet aL 
together and runs wild. From the even rarer cases that are invesrigated and 
told with care and accuracy, we know that if the isolation from human So¬ 
ciety lasts for years, rather than for just months, the children are intellec' 
tually stunted for life." Such was, for example, the experience of the “Wild 
Boy of Aveyron,” discovered in sourhem France soon after the Revolution 
and the establishment of the first French Republic, like an invitarion to 
confinn Rousseau s vision of rhe noble savage. The 12- or 11-year-old hoy 
had been found runningnaked in the woods, mute, wild, and evidently out 
of contact with humanity for most of his life. But, as it turned out, neither 
he, nor the others like him that we know about, resemhle Rousseau* no¬ 
ble savage in the least. Most of them never learn ro speak properly or to 
become independent adults. They rarely learn to meet even rhe lovvest 
Standards of personal hygiene or conduct. They seem unable to become 
fully human despite heroic efforts to restore them to society. From these 
rare cases we can draw a hopeful conclusion: If the ordinary human envi- 
ronment is so essential for bestowing human inrelligence, we shuuld he 
able to create extraordinary environments to raise it further. 1 '* 1 


have altered the pool of subjects for adoption studies. Both the envi¬ 
ronmental and genetic legacies of children put up for adoption have 
surely changed over the years, but it is impossible to know exactly in 
what ways and how much. In short, although data are abundant and we 
will draw some broad conclusions, this is an area in which solid esti- 
mares are unlikely to be found. 

As a group, adopted children do not score as high as the biological 
children of their adopting parents. 1791 The deficit may be as large as seven 
ro ten IQ points. It’s not completely clear what this deficit means. One 
hypothesis is that the adopted children’s genes hold them hack; another 
is that there is an intellectually depressing effect of adoption itself, or 
rhat being placed in adopting homes not immediately after birth (asonly 
some of them are), but only after several months or years, loses the ben¬ 
efit of the nurturing their adopting parents would have provided earlier 
in their lives. 

At the same time, researchers think it very Likely that adopted chil¬ 
dren eam higher scores than they would have had if they been raised by 
their biological parents, because the adopting home environment is 
likely to be better than the one their biological parents would have pro¬ 
vided. If so, this would he a genuine effect of the home environment. 
How large is the effect? Charles Locurto, reviewing the evidence and 
srriking an average, concludes that it is about six points. 80 As a Consen¬ 
sus figure, that seems about right to us. However, a consensus figure is 
not what we want, as Locurto recognizes. It does not identify how wide 
a gap separates the environments provided by adopting homes and the 
homes in which the children would have been reared had they not been 
adopted. We seek a comparison of the IQs of children growing up in 
homes of a known low socioeconomic Status and genetically compara- 
hle children reared in homes of a known high socioeconomic Status. 
What would the increment in IQ look like then? 

Two approximations to an ideal adoption study, albeit with very small 
samples, have recently been done in France. 81 In one, Michel Schiff and 
his colleagues searched French records for children abandoned in in- 
fancy, born to working-class (unskilled) parents, who were adopted into 
upper-class homes. Only thirty-two children met the study’s criteria. In 
childhood, their average IQ was 107. To understand what this means, 
two further comparisons are in order. First, the adopted children scored 
eight points lower on average than their schoolmates, presumably from 
comparable upper-class homes. This confirms the usual finding with 
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adopted children. But, second, they scored twelve points higher than 
twenty of their full or half-siblmgs who were reared at least for a time 
by a biological parent or grandparent in lower-c.lass surroundings. |lv| 
This study provides a rare chance to estimate roughly wherc the 
adoptees would have been had they remained in their original homes. 

A second French study compared four small groups of adopted chil¬ 
dren, reared in either high- or low-SES homes, and the biological off- 
spring of high- or low-SES parents. Thus one could ask, albeit with only 
a handful of children, 1 what happens when children horn to low-SES 
parents are adopted into a high-SES home or when children born to 
high-SES parents are adopted into low-SES homes; and so on. ln this 
study as well, the switch from low to high Status in the home environ- 
ment produced a twelve-point benefit in IQ. 1 * 41 Such findings, of course, 
implicate the home environment as a factor in the development of cog- 
nitive ability. We cannot be sure how much, because we do not know 
exactly how far down the SES ladder the children came from, or how 
far up the ladder they were moved into their adoptive homes. lf the 
twelve-point shift is produced by a small shift in environment (e.g., a 
child of a truck driver adopted by the family of a bank clerk), it gives a 
great deal of hope for the effects of adoption; if it was produced only by 
a huge shift in the environment (e.g., the child of a chronically unem- 
ployed illiterate adopted by the Rothschilds), not so much hope. In gen¬ 
eral, the more important the environment is inshapingcognitive ability, 
the larger the impact a given change in environment has on IQ. 

To see what the policy implications might be, let us suppose that low- 
and high-SES homes in the French studies represcnted the lOth and 
90th centiles in the quality of the home environment, respectively. If 
that were the case, what might be accomplished by moving children 
from very deprived homes (at the 2d centile, to make the example con- 
crete) to very advantaged ones (98th centile)?The resultsof the French 
study imply that such a shift in home environment would produce a ben¬ 
efit of almost twenty IQ points. 1iH:)1 

A swing of twenty points is considerable and seems to open up the 
possibility of large gains in intelligence to be had by equalizing homes 
“upward,” by appropriating for more families whatever nurturing things 
go on in the homes of the top 1 or 2 percent in socioeconomic Status. ls<>1 
The problem, obviously, is that no one knows how to equalize environ- 
ments upward on so grand a scale, particularly since so much of what 
goes on in the nurturing of children is associated with the Personality 


and behavior of the parent, not material wealth. This brings us to a va- 
riety of policy issues that it is now time to discuss more explicitly. 

A POLICY AGENDA 

Research 

Nothing is more predictable than that researc.he.rs will conclude that 
what is most needed is more research. In this case, however, the usually 
predictable is a little less so. 

Certain kinds of research are not needed. Next to nothing is to be 
learned about how to raise IQ hy more evaluations of He ad Start, or 
even hy replicating much better programs such as Perry Preschool or 
Abecedarian. The main lesson to be learned front these better programs 
has already been learned: It is tough to alter the environment for the 
development of general intellectual ability hy anything short of adop- 
rion ai birth. By now, researchers know enough to he confident that the 
next demonstration program is not going to he the magic. bullet, because 
they have already demonstrated beyond dispute that the ‘environment" 
is an unimaginably complex melange of influences and inputs for all the 
ehilcls waking hours (and perhaps some sleeping hours too). No mean- 
ingful proportion of that melange can reasonably be expected to he 
shaped by mry outside Intervention into the child s social environment, 
even one that lasts eight hours a day, using the repertoire of techniques 
now availahle. To have a large effect, we need new knowledge about 
cognitive development. 

New knowledge is likely to come from sharply focused investigations 
into the development of cognitive ability, conducted in an atmosphere 
that imposes no constraints on the researchers other than to seek and 
find useful knowledge within common ly accepted ethical constraints. 
The most promising leads may come from insights into the physiologi- 
calbasisof intelligence rather than from the cultural oreducational vari¬ 
ables that have been customary in educational research. Lung-term 
funding, buffers against hureaucratic mecldling, read iness to tund re¬ 
search on the hardest questions, if they are hrought forward by the in¬ 
ner logic of the Science, and not just the politically correct questions: 
This is what is needed, and what today’s research programs seldom pro- 
vide. With that set of caveats on the table, more research is indeed at 
the top of our policy agenda. Because intelligence is less than eompletely 
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Rentable, we can assume that, some day, it will be possible to raise the 
intelligence of children through environmental interventions. Bur new 
knowledge is required. Scientific research is the only way to get it. 

Nutrition 

Advocating that all children receive good nutrition does not come Lin¬ 
der the heading of daring new ideas. We advocate it nonetheless. Espe- 
cially if the inconsistent but suggestive results about the effects of 
vitamin and mineral Supplements on cognitive functioning are home 
out, it would be worth considering such Supplements as part of school 
and preschool lunch programs. 

Investment in Schooling 

When quantum changes are made in education—moving from no ed- 
ucation to an elementary education, or from 6 years of schooling to 12— 
then broad gains can occur, but the United States has in most respects 
passed this stage. Additional attempts to raise IQ through special ac- 
celerated courses have modest effects: short-term gains of two to four 
IQ points after extensive training. Long-term gains are less clear and 
likely to be smaller. In short, the school is not a promising place to try 
to raise intelligence or to reduce intellectual differences, given the con- 
straints on school budgets and the state of educational Science. 

General Purpose Preschool Programs 

Much is already known about what can be accomplished by ordinarily 
good preschool interventions—“ordinarily good” meaning that a few 
modest ly trained adults who enjoy being with children watch over a few 
dozen children in a pleasant atmosphere. It is hard to know how many 
Head Start programs reach this Standard. But a vast amount of research 
teils us that even ordinarily good Head Starts do not affect cognitive 
functioning much if at all. There is no reason to think that any realis- 
tically improved version of Head Start, with its thousands of centers and 
millions of participants, can add much to cognitive functioning. Even 
the claims for long-term benefits of Head Start on social behavior are 
unsubstantiated. 

Such findings do not invalidate Head Starts value as a few hours’ 
daily refuge for small children who need it. But the dehate over Head 
Start should move away from frivolous claims about how many dollars 


it will save in the long run, none of which Stands up to examination, 
and focus instead on the degree to which it is actually serving the laud- 
able and more fundamental function of rescuing small children from un- 
suitahle, joyless, and dangerous environments. 

Highly Targeted Preschool Programs 

The nation cannot conceivably implement a Milwaukee Project or 
Ahecedarian Project for all disadvantaged children. It is not just the dol- 
lar costs that put such ambitions out of reach (rhough they do) but the 
impossibiIity of Staffing them. With teacher-to-child ratios ranging as 
high as one to three and staff-to-child ratios even higher, tbese programs 
come close to calling for a trained person per eligible child. 

But should such programs be mounted for the extremes—the chil¬ 
dren lar out in the left-hand tail of home environments? We are not 
ralking about children who are just poor or just living in bad neighbor- 
hoods, but children who are at high risk of mental rctardation in an aw- 
ful environment, with parents who function at a very low cognitive 
level. Should such children he enrolled, within a few weeks of birth, in 
a full-time day care setting until they begin kindergarten? 

The decision cannot be justified pure ly on grounds of cognitive ben¬ 
efits, judging from what has come out ot the Milwaukee and Ahecedar- 
ian projects. On the other hand, the evidence about improvements in 
social ad just ment from the Perry Preschool Project may he relevant, if 
they stand up to further critical scrutiny. If they do, then highly inten¬ 
sive preschool programs have an important role to play in socializing 
children from highly disadvantaged backgrounds. Such results are not 
as hopeful as they are sometimes portrayed, but they may he substantial. 
Earlier, we said that the cost-benefit claims for Head Start could not 
withstand examination. For programs that achieve results comparable 
ro those claimed for Perry Preschool, perhaps they could. 18 ' 1 But even 
this limited endorsement is applicable only to the small fraction of the 
population that is both at substantial risk for mental retardation and Ev¬ 
ing in the worst conditions. Comparatively few children typically das- 
sified as “disadvantaged” fall in that category. 

Adoption 

Adoption at birth from bad environments into good environments 
raises cognitive functioning, especially in childhood and by amounts 
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that are not weil established. In general, the worse the Home that would 
have been provided by the biological parents and the better the adop- 
tive home, the greater is the cognitive benefit of adoption. Adoption at 
birth seems to produce positive non cognitive effects as well. In terms of 
govemment Budgets, adoption is cheap; the new parents Bear all the 
costs of twenty-four-hour-a-day care for eighteen years or so. The sup- 
ply of eager and qualified adoptive parents for infants is large, even for 
infants with special needs. 

If adoption is one of the only affordable and successful ways known 
to improve the life chances of disadvantaged children appreciably, why 
has it been so ignored in congressional debate and presidentiai propos- 
als? Why do current adoption practices make it so difficult for would- 
be parents and needy infants to match up? Why are cross-racial 
adoptions so often restricted or even banned? All these questions have 
political and social answers that would take us far outside our territory. 
But let it be said plainly: Anyone seeking an inexpensive way to do some 
good for an expandable number of the most disadvantaged infants 
should look ar adoption. 

The tough question about adoption involves the way the adoption 
decision is made. Governments should not be able to force parents to 
give up their children for any except the most compelling of reasons. 
Right now, the govermnent already has the power (varying hy state), 
based on evidence of neglect and abuse, which we do not advocate ex- 
panding. Instead, we want to retum to the state of affairs that prevailed 
until the 1960s, when children born to single women—where rnuch of 
the problem of child neglect and abuse originates—were more likely to 
be given up for adoption at birth. This was, in our view, a better siare 
of affairs than we have now, Some recommendations for turning back 
this particular clock are in Chapter 22. 

R ealism 

An inexpensive, reliable method of raising IQ is not available. The wish 
that it were is understandahle, and to pursue the development of such 
methods is worthwhile. But to think that the available repertoire of so¬ 
cial interventions can do the job if only the nation spends more money 
on them is illusory. No one yet knows how to raise low IQs substantially 
on a national level. We need to look elsewhere for solurions to the Prob¬ 
lems that the earlier chapters have described. 


Chapter 18 

The Leveling of American 
Education 

Most peoplc think that American public education is in terrible shape , and any 
number of allegations seem to confirm it. But a search of the data does not re - 
veal that the typical American school child in the past would have done any 
better on tests ofacademic skills. An American youth with average IQ is prob - 
ably better prepared academically now than ever before. The problem with 
American education is confined mainly to one group of students, the cogni - 
twely gifted. Among the most gifted students, SAT scores startedfalling in the 
mid-1960s, and the verbal scores have not recovered since. 

One reason is that disadvantaged students have been “in” and gifted .stu¬ 
dents “out” for thirty years. Even in the 199Os, only one-tenth of I percent 
of all the federal funds spent on elementary and secondary education go to 
programs for the gifted. Because success was measured in terms ofhow well 
the average and below-average children performed, American education was 
dumbed down: Textbooks were made easier, and requirements for courses , 
homeworh, and graduatiotx were relaxed. These measures may have worked 
as intended for the average and below-average students , but they let the gifted 
get away without ever devebping their potential. 

In thinkmg about policy, the first Step is to realixe where we are. ln a uni¬ 
versal education System, many students will fall short of basic academic com- 
petence. Most American parents say they are already satisfied with their local 
school. The average Student has little incentwe to work hard in high school. 
Oetting into most Colleges is easy, and achievement in high school does not 
pay off in higher wages or better jobs for those who do not go to College . On 
a brighter note, realism also bads one to expect that modest improvements in 
the education of average students will continue as they have throughout the 
Century except for the aberrational period from the mid-1960s to mid -1 970s . 

In trying to build on this natural improvement, the federal govemment 
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should Support greater flexibility for parents to send their children to schools 
of their choosing , whether through vouchers , tax credits , or choice within the 
public schools. Federal scholarships should reward academic performance. 
Some federal funds no<w so exclusively focused on the disadvantaged should 
be reallocated to progjams for the gifted. 

We urge primarily not a set of new laws but a change of heart within the 
ranks of educators. Until the latter half of this Century , it was taken for granted 
that one of the chief purposes of education was to educate the gifted —not be - 
cause they deserved it through their own merit but because, for better or worse , 
thefuture of society was so dependent on them. It wasfurther understood that 
this education must aim for more than technical facility. It must be an edu¬ 
cation thatfosters wisdom and virtue through the ideal of the u educated man. ” 
L ittle will change until educators once again embrace this aspect of their tLo¬ 
cation. 


T he education of the young is something that all human societies are 
committed to do. They can do it well or poorly. Many billions of 
dollars are already available for education m America. Can we spend 
them more wisely and produce better results? Our corner of the topic is 
how cognitive ability fits into the picture. 

It seemsself-evident: Education is what intelligence is mostobviously 
good for. One ideal of American education is to educate everyone to his 
or her potential. The students with the most capacity to ahsorb educa¬ 
tion should get the most of it—most in years, breadth, depth, and chal- 
lenge. But what should be self-evident is not. For thirty years, IQ has 
been out of fashion among American educators, and the idea that peo- 
ple with the most capacity to be educated should hecome the most ed- 
ucated sounds dangerously elitest. 

It needs to be said openly: The people who nin the United States— 
create its johs, expand its technologies, eure its sick, teach in its uni- 
versities, administer its cultural and political and legal institutions—are 
drawn mainly from a thin layer of cognitive ability at the top. (Re- 
member—j ust the top 1 percent of the American population consists of 
2.5 million people.) It matters enormously not just that the people in 
the top few centiles of ability get to College (almost all of them do, as 
we described in Chapter 1) or even that many of them go to elite Col¬ 
leges but that they are educated well One theme of this chapter is that 


since the 1960s, while a cognitive elite has become increasingly segre- 
gated from the rest of the country, the quality of the education they re- 
ceive has been degraded. They continue to win positions, money, 
prestige, and success in competition with their less gifted fellow citizens, 
but they are less well educated in the ways that make smart children 
into wise adults. 

Letting people develop to their füllest potential is not the only im¬ 
portant goal of public education. Since the founding of the republic, 
thoughrful Americans have recognized that an educated citizenry is vi¬ 
tal to its survival. This chapter therefore examines how well our coun¬ 
try fares in educating the average Student—not the one who is likely to 
occupy a place among the cognitive elite but the one most representa- 
tive of the typical American. We find that the average American young- 
ster is probably doing better on tests of academic skills than ever before. 
We will try to understand why a sense of crisis nevertheless surrounds 
American education despite this unexpected good news. 

We begin with quantitative evidence that shows the general outline 
of these trends and their connection to each other. Then we switch to 
observations of the kind that do not lend themselves to survey results 
or regression equations but that we believe to he justified by everyday 
experience in our schools and Colleges. 

TRENDS IN EDUCATION Is THE AVERAGE STUDENT 

A few years ago, the Wall Street Journal devoted its op-ed page to a re- 
production of an examination administered by Jersey City High School 
in 1885. 1 lt consisted of questions such as the following: 

Find the product of 3 + 4x + 5x 2 - 6x* and 4 - 5x - 6x\ 

Write a sentence containing a noun used as an attribute, a verb in 
the perfect tense potential mood, and a proper adjective. 

Name three events of 1777. Which was the most important and why? 

The fest was not for high school graduation (which would he impres- 
sive enough) but for admission to Jersey City High School. Fifteen-year- 
olds were supposed to know the answers to these questions. Of course, 
not many people went to high school in 1885. But could even the cream 
of the 15-year-olds in Jersey City s middle schools pass that exam today? 
It seems unlikely. 
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Bits of national memorabilia like this reinforce an Impression that is 
nearly universal in this country: American elementary and secondary 
education used to he better. The 1983 report by the Department of Ed- 
ucation, A Nation at Risk, said so most famously, concluding that “we 
have, in effect, been committing an act of unthinking, unilateral edu- 
cational disarmament.” 2 Its chairman concluded flatly that “for the first 
time in the history of our country, the educational skills of one Genera¬ 
tion will not surpass, will not equal, will not even approach, those of 
their parents.” 3 

We begin by affirming the conventional wisdom in one respect: The 
academic performance of the average American Student looks awful at 
first glance. Consider illireracy, for example. Some authorities claim 
that a third of the population is functionally illiterate. 4 No one really 
knows—when does “literacy” hegin?—hut no matter where the precise 
figure lies, the proportion is large. As of 1990, 16 percent of the 17-year- 
olds still in school were below the level called “intermediate” in the Na¬ 
tional Assessment of Educational Progress (NAEP) reading test—in 
effect, below the threshold for dealing with moderate ly comp lex writ- 
ten materialP Then one must consider thar more than 20 percent of 17' 
year-olds had already dropped out of school and were not part of the 
sample, 6 bringing us somewhere above 20 percent of the population who 
cannot use reading as a flexible tool of daily life. 

There is a profusion of horror stories in other subjects. Fewer than 
one in three American 17-year-olds in a nationally representative sam¬ 
ple could place the Civil War within the correct half-centuvy ot its ae- 
tual occurrence. 7 Fewer than 60 percent of American 17-year-olds could 
correctly answer the item, “A hockey team won five of its 20 games. 
What percent of the games did it win?” 8 More than 60 percent of adults 
in their early twenties cannot synthesize the main argurnent of a news- 
paper article. 9 Forty-four percent of adult Americans cannot understand 
“help wanted” ads well enough to match their qualifications with the 
job requirements. Twenty-two percent cannot address a letter well 
enough to make sure the post office can deliver it. 10 

Critics of American education also point to international compar- 
isons. Between the early 1960s and the end of the 1980s, six major in¬ 
ternational studies compared mathematical competence, science 
knowledge, or hoth, across countries. 11 ' 1 The National Center for Edu¬ 
cation Statistics has conveniently assembled all of the results for the 
first five studies in a series of twenty-two tables showing the United 


States’ ranking for each scale. The results for the industrialized coun¬ 
tries are easily summarized: In seven of the twenty-two tables, the 
United States is at the very hottom; in eight others, within two coun- 
tries of the bottom; in four of the remaining seven, in the bottom half. 1 ^ 
The most recent study, conducted in 1991, found that the United States 
continued to rank near the bottom on every test of every age group for 
the mach tests and near the middle on the science tests. 13 

International eomparisons need to be interpreted cautiously. |14J But 
the most common defense for Americas poor showing is losing credi- 
hility. For years, educators excused Americas performance as the price 
America pays for reraining such a high proportion of its students into 
high school. But Japan has had as high a retention rate for years, and 
recently many European nations, including some that continue to 
outscore us on the international tests, have caught up as wellA 

The picture is sure ly depressing. But as we look back to the idealized 
America of the earlier part of the Century, can we catch sight of Amer¬ 
ican school children who, on average, would have done any better on 
such measures than the youngsters of today? A growing number of ed- 
ucational researchers are arguing that the answer is no. 16 With qualifi¬ 
cations that the chapter will explain, we associate ourselves with their 
findings. Aceordmg to every longitudinal measure that we have been able to 
find , there is no evidence that the Irreparation of the average American youth 
is worse in the 1990s than it hm ever been. Considerable evidence suggests 
that, on the contrary, education for the average youth has improved 
steadily throughout the twentieth Century except for a period of decline 
in the late 1960s and early 1970s (which justified to some degree the 
alarming conclusions of the early 1980s) but front which the educational 
system has already fully recovered. How can we get away with these 
Statements that seem so contrary to what every orte knows? We dt) it by 
means of that innocuous word, “average.” 

Düring the first half of the twentieth Century, education for the average 
American young person improved steadily, partly hecause the average 
American young person spent more time in school than previously 
(Chapter 6). But much other evidence, marshaled convineingly by 
economist John Bishop, indicates a steady, long-term improvement in 
what Bishop calls “general intellectual achievement” that extended 
from the earliest data at the turn of the Century into the 1960s. 1 ' Even 
if we discount some of these results as reflections of the Flynn effect, 1!81 
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ir is impossible to interpret the data from 1900 to 1950 as showing any- 
thing other than some improvement. Then in the mkM960s began a 
period of deciine, as manifested most notably by the fall in SAT scores. 
Many people are under the impression that the deciine was deep and 
permanent for the entire population of students. ln reality, the deciine 
for the average Student was modest and recovery was quick. We know 
this first through the NAEP, begun in 1969, which we discussed with 
regard to ethnic. differences inChapter 13, lwl When the first NAEP tests 
were given, the SAT score deciine was in its fifth year and would com 
tinue for most of the next decade. The SAT is generally for a popula- 
tion concentrated at the upper end of the cognitive ability distribution, 
whereas the NAEP is for a nationally representative sample. While the 
scores for the population taking the SAT were still declining, the trend- 
lines of the NAEP results were flat. The differences between the earli- 
est NAEP scores in reading, Science, and math (which date from 1969 
to 1973, depending on the test) and the scores in 1990 are a matter of 
a few points and small fractions of a Standard deviation, and scores ob 
ten went up rather than down over that period . 1201 

SAT scores had started declining in 1964, but the NAEP goes back 
only to 1969, To reach back further for nationally representative data, 
we turn first to five almost completely unpublicized studies, known cob 
lectively as the national norm studies, conducted by the Educational 
Testing Service (ETS) in 1955, 1960, 1966, 1974, and 1983. ln these 
tests, a short Version of the SAT (the Preliminary Scholastic Aptitude 
Test, or PSAT) was administered to a nationally representative sample 
of American high school juniors. The results aresuminarized in the table 
below, adjusted so as to represent the mean score that all American ju- 


What SAT Score Deciine? The Results of the National 
Norm Studies, 1955-1983 


Year 

Verbal Mean 

Math Mean 

1955 

oo 

417 

1960 

374 

410 

1966 

383 

395 

1974 

368 

402 

1983 

376 

411 


So uro? .s: Goto 1955; Chandler and Schräder 1966; Katz and orhcrs 1970; 
Jackson and Schräder 1976; Braun, Ccntra, and King 1987. 


niors would have received on the SAT had they stayed in school for 
their senior years and had they taken the SAT. 

These results say that American eleventh graders as of 1983 were, as 
a whole, roughly as well prepared in both verhal and math skills as they 
had been when the collcge-bound SAT scores were ar their peak in 
1963, and noticeahly strenger in their verbal skills than they had been 
in the first norm study in 1955. The deciine in verhal scores between 
the 1966 and 1974 tests was 15 points—only about .14 Standard devia¬ 
tion. Ahout half of that had been recovered by the 1983 test. 1 " 1 ' 

A third source is the Iowa Test of Educational Development (ITED), 
a well-validared test, equated for stability front year to year, that has 
been administered to virtually a 100 percent sample of Iowa’s high 
school students for fifty years. What may one learn from rural, white 
Iowa? For examining trends in educational outcoincs over time, quite a 
hit. lowa’s sample of students provides socioeconomic variance—even 
Iowa has single-parent families and welfare recipients. Paradoxically, 
lowa’s atypical racial homogeneity (the population was more than 97 
percent non-Latino white throughout the period we are discussing) is 
an advantage for a longitudinal analysis by sidestepping the difficulties 
of analyzing trends for populations rhat are changing in their ethnic 
composirion. In examining Iowa’s test scores over time, we may nur he 
ahle to make judgments ahout how the education of minorities has 
changed hur we have a good view of what happened over the last sev- 
eral decades for the white population. 

Test scores for high school students in Iowa increased from the early 
1940s to the mid-1960s, dropped sharply from 1966 to 1978, but then 
rebounded, as shown in the figure below. We show the ninth-grade 
scores, which have been least affected by changes in dropout rates dur- 
ing the last fifty years. They show a steep rise through 1965 and an 
equally steep rise after 1977, reaching new heights from 1983 onward. 1 " 21 
The improvement has been substantial—on the order of half a Standard 
deviation since the mid-1970s, and about .2 Standard deviation ahove 
the previous high in 1965. The increase of 5.3 points from 1942 to 1992 
may he interpreted as approaching one Standard deviation. 

Evidence from other, independent sources is consistent with the story 
told hy the national norm studies and the Iowa data. Project TALENT, 
the huge study of high school students undertaken in 1960, readminis- 
tered its reading comprehension test in 1970 to another sample and 
found that a nationally representative sample of eleventh graders had 



424 Living Together 


A half-century of Iowa tests: Improvement as the norm, 
the slump as a twelve-year aberration 


Composite score of Iowa 9th-graders 

on the Iowa Test of Basic Skills 
16 - 



1942 1952 1962 1972 1982 1992 


Source: Iowa Testing Program, University of Iowa. 


gained slightly over its counterpart of I960, during the same decade that 
saw the steepest decline in the SAT. Other data on state tests in Vir¬ 
ginia, New York, Texas, and California, summarized by the Congres- 
sional Budget Office in its study of trends in educational achievemenr, 
cannot match the time ränge of the Iowa or SAT norm data, but, within 
their limits, they are generally consistent with the picture we have 
sketchedV Even the international assessments are consistent. The 
United States had some of its worst results in the first international as- 
sessment, conducted in the early to mid- 1960s when American SAT 
scores were near their peak. 24 Since then, the national American aver- 
ages have been, on balance, rising and the deficit in international com- 
parisons shrinking. 

Taken as a whole, the data from representative samples of high school 
students describe an American educational System that was prohably 
improving from the beginning of the Century into the mid-1960s, un- 
derwent a decline into the mid-1970s—steep or shallow, depending on 
the study—and rebounded thereafter. Conservatively, average high 
school students seem to be as well prepared in math and verbal skills as 
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they were in the 1950s. They may be betrer prepared than they have 
ever heen. If U.S. academic skills are deficient in comparison with other 
nations, they have heen comparativcly so for a long time and are prob- 
ahly hetrcr than they were. 

TRENDS IN EDUCATION II: COLLEGE STUDENTS 

Having questioned the widespread belief that: high school education To¬ 
day is worse on average than it useci to he, we now reverse course and 
oder some reasons for thinking that it has gotten worse for one specific 
group of students: the pool of youths in the top 10 to 20 pcrcent of the 
cognitivc ability distribution who are prime College material. To make 
this case, we will focus on the best-known educational trend, the de¬ 
cline in SAT scores. Visually, the story is told by whar must be the most 
frequently puhlished trendlines in American educational circles, as 
shown below. |2SI 

The steep drop from 1963 to 1980 is no minor Statistical fluctuation. 
Taken at face value, it teils of an extraordinarily large downward shift 
in academic aptitude— almost half a Standard deviation on the Verbal, 
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almost a third of a Standard deviation on the Math. [2<l1 And yer we have 
just finished demonstrating that this large change is not reflected in the 
aggregate national data for high school students. Which students, then, 
account for the SAT decline? We try to answer that question in the next 
few paragraphs, as we work our way through the most common expla- 
nation of the decline. To anticipate our conclusion, the Standard ex- 
planation does not stand up to the data. We are left with compelling 
evidence of a genuine decline in the intellectual resources of our bright- 
est youngsters. 

The most familiär explanation of the great decline is that the SAT 
was “democratized” during the 1960s and 1970s. The pool of people tak- 
ing the test expanded dramatically, it is said, bringing in students from 
disadvantaged Backgrounds who never used to consider going to Col¬ 
lege. This was a good thing, people agree, but it also meanr that test 
scores went down—a natural consequence of breaking down the old 
elites. The real problem is not falling SAT scores but the inferior edu- 
cation for the disadvantaged that leads them to have lower test scores, 
according to the Standard account.“ ; 

This common view is mistaken. To make this case requires delving 
into the details of the SAT and its population. “ s To summarize a complex 
story: During the 1950s and into the early 1960s, the SAT pool expanded 
dramatically, but scores remained steady. ln the mid-1960s, scores 
started to decline, but, by then, many state universities had become less 
selective in their admissions process, often dropping the requirement 
that students take SATs, and, as a result, many of the students in the mid¬ 
dle level of the pool who formerly took the SAT stopped doing so. Fo- 
cusing on the whites raking the SAT (thereby putting aside the effects 
of the changing ethnic composition of the pool), we find that throughout 
most of the white SAT score decline } the white SAT pool was shrinking, not 
expanding. We surmise that the white population of test takers during 
this period was probably getting more exclusive socioeconomically, not 
less. It is virtually impossible that it was becoming more democratized in 
any socioeconomic sense. 

After 1976, when detailed background data on white test takers be¬ 
come available, the evidence is quite explicit. Although the size of the 
pool once again began to expand during the 1980s, neither parental in- 
come nor parental education of the white test takers changed. 12gl After 
factoring in the effects of changes in the gender of the pool and changes 
in the difficulty of the SAT, we conclude that the aggregate real decline 


irom 1963 to 1976 among whites taking the SAT was on the order of 
thirty-four to forty-iour points on the Verbal and fifreen to twenty-five 
points on the Math. From 1976 to 1993, the real white losses were no 
more than a few additional points on the Verbal. On the Math, white 
scores improved about three or four points in real terms alter changes 
in the pool are taken into account. Or in other words, when everything 
is considered, there is reason to conclude that the size ol the drop in the 
SAT as shown in that familiär, unsophisticated graphic with which we 
opened the discussion is for practical purposes the same size and shape 
as the real change in the academic preparation of white college-bound 
SAT test takers. Neither race, dass, parental education, composition of 
the pool, nor gender can explain this decline of forty-odd points on the 
Verbal score and twenty-odd points on the Math for the white SAT-tak- 
ing population during the 1960s and 1970s. For whatever reasons, dur¬ 
ing the 1960s America stopped doing as well intellectual ly by the core 
of students who go to College. 

Rather than democratization, the decline was more probably due to 
levelingdown, or mediocritization: adovvnward trendof the educational 
skills of America’s academically most promising youngsters toward those 
of the average Student. The net drop in verbal skills was especially large, 
mach larger than net drop in math skills. It affected even those students 
with the highest levels of cognitive ability. 

Does this drop represent a fall in realized intelligente as well as a drop 
in the quality of academic training? We assume that it does to some ex¬ 
tern but are unwilling to try to estimate how much of which. The SAT 
score decline does underscore a frustrating, perverse reality: However 
hard it may be to raise IQ among the less talented with discrete inter- 
ventions, as described in Chapter 17, it may he within the capability of 
an educational System—probably with the complicity of broader social 
trends—toput aceilingon,oractually dampen, the realized intelligence 
of those with high potential/ 0 

TRENDS IN EDUCATION III: THE BRIGHTEST OF THE 
BRIGHTEST 

One more piece of the puzzle needs to be put in place. The SAT popu¬ 
lation constitutes a sort of broad elite, encompassing but not limited to 
the upper quartile of the annual national pool of cognitive ability. What 
has been happening to the scores of the narrow elite, the most gifted 
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students—roughly, those with combincd scores of 1400 and morc—who 
are most likely to fill the nation’s best graduate and Professional schools? 
They have gone down in the Verbal test and up in the Math. 

The case for a drop in the Verbal scores among the brightest can be 
made without subtle analysis. In 1972, 17,560 college-hound seniors 
scored 700 or higher on the SAT-Verbal. In 1993, only 10,407 scored 
700 or higher on the Verbal—a drop of 41 percent in the raw number 
of students scoring 700 and over, despite the larger raw number of stu¬ 
dents taking the test in 1993 compared ro 1972. n Dilution of the pool 
(even if it were as real as legend has it) could not account for smaller 
raw numbers of high-scoring students. But we may make the case more 
systematically. 

The higher the ability level, the higher the proportion of students 
who take the SAT At the 700 level and beyond, the proportion ap- 
proaches 100 percent and has probably been so since the early 1960s 
(see ühapter 1). That is, almost all 17"yeanolds who would score above 
700 if they took the SAT do in fact take the SAT at some point in their 
high school career, either because of their own ambitions, their parents’, 
or the urging of their teachers and guidance counselors. It is therefore 
possible to think about the students who score in the 700s on the SAT 
as a proportion of all 17-year'olds, not just as a proportion of the SAT 
pool. We cannot carry the story back further than 1967 but the results 
are nonetheless provocative, as shown in the next figure. 1 * 21 

The good news is that the mathematics score of the top echelon of 
American students has risen steeply since hitting its low point in 1981. 
(Diven all the attention devoted to problems in American education, 
this finding is worth lingering over for a moment. In a period of just 
twelve years, front 1981 to 1993, the proportion of 17-year-olds scoring 
over 700 on the SAT-Math test increased by 143 percent. This dramatic 
improvement during the 1980s is not explainable by any artifact that 
we can identify, such as having easier Math SAT questions. 1 ^ 1 Nor is it 
due to the superior math performance of Asian-American students and 
their increase as a proportion of the SAT population. Asian-Americans 
are still such a small minority (only 8 percent of test takers in 1992) that 
their accomplishments cannot account for much of the national im- 
provernent. The upward bounce in the Math SAT from 1981 through 
1992 was a robust 104 percent among whites. [H * 

Now let us turn to the less happy story about the SAT'Verbal. The 
proportion of students attaining 700 or higher on the SAT feil sharply 


Among the most gifted students, there is good 
news about math, bad news about verbal 


700+ scorers, as a percentage of 17-year-olds 



SourcL*: The (College Board. 

from 1967 to the mid-1970s. Furthermore, SAT scores as of 1967 had 
been dropping for four years before that, so we Start from a Situation in 
which the verbal skills of Americas most gifted students dropped pre- 
cipitously from the early 1960s to the early 1970s. Unlike the Math 
scores, however, the Verbal scores did not rehound significantly. Nor 
may one take much comfort from the comparatively shailow slope of 
the decline as it is depicted in the figure. The proportional size of the 
drop was large, from about eight students per 1,000 17-ycar-olds in 1967 
to three per 1,000 in 1993, a drop of about 60 percent. lbi The other ma- 
jor source of data about highly talented students, the Graduate Record 
Examination, parallels the story for the students scoring 700 or above 
on the SAT. 1 * 1 

AN EXPLANATION: DUMBING DOWN 

How might these disparate and sometimes contradictory trends be tied 
together? 

One important part of the story begins with the 1950s. Why didn’t: 
the scores fall, though the proportion of students taking the SAT went 
from a few percent to almost a third of the high school population in 
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little more than a decade? The answer is that the growing numbers of 
SAT takers were not students with progressively lower levels of acade- 
mic ability but able students who formerly did not go on to College or 
went to the state university (and didn’t take the SAT) and now were 
broadening their horizons. This was the post-World War II era that we 
described in Chapter 1, when educational meritocracy was on the rise. 
As the path to the better Colleges began to open for youngsters outside 
the traditional socioeconomic elites, the population of test takers grew 
explosively. Düring this period, we can safely assume that the pool 
opened up to new socioeconomic groups, but it occurred with no dilu- 
tion of the pool’s academic potential, because the reservoir of academic 
ability was deep. Then, as the 1950s ended, another factor worked to 
sustain performance: From the Sputnik scare in 1957 through the early 
1960s, American education was gripped by a get-tough reform move¬ 
ment in which math and the Sciences were emphasized and high schools 
were raising Standards. Education for the College bound probably im- 
proved during this period. 

Softened Standards 

Then came the mid- 1960s and a decade of decline. What happened to 
education during this period has been described by many observers, and 
we will not recount it here in detail or place blame. [37] The simple and 
no longer controversial truth is that educational Standards declined, 
along with other momentous changes in American society during that 
decade. 

The educational change is epitomized by the title for this section. 
“Dumbing down” has become a term of art for the process by which the 
vocabulary in a textbook is deliberately simplified. We use it in a broader 
sense. One of the chief effects of the educational reforms of the 1960s 
was to dumb down elementary and secondary education as a whole, 
making just about everything easier for the average Student and easing 
the demands on the gifted Student. 

The dumbing down of textbooks permeated the textbook market, as 
publishers and authors strove to satisfy school boards, which routinely 
applied “readability” formulas to the books they were considering. 38 
Thomas Sowell has described a typical example of this process, in which 
the words spectacle and admired were deleted from a textbook because 
they were deemed too difficult for high school students. Sowell com- 
pares such timidity to the McGuffey’s Readers , the staple text of nine- 


teenth-century children in one-room schoolhouses, pointing out that 
the Third Reader used words such as species, dialogue, heath , and be- 
nighted —intended for 8-year-olds. 39 

Dumbing down also occurred in the high schooFs College track. More 
electives were permitted, and the requirements for credits in Science, 
mathematics, and literature were relaxed. There were exceptions, such 
as the high-quality Advanced Placement courses offered in a minority 
of high schools, taken by about 1 percent of American students. 40 But 
the broader result was that the number of courses in the core disciplines 
declined. Educational specialists agree that grades inflated—it took less 
work, and less homework, to earn good grades 41 —and that less home- 
work was done. [42] 

In this context, it comes as no surprise that SAT scores declined even 
among the diminishing proportion of high school seniors who took the 
SAT during the last half of the 1960s. Indeed, it was not just students 
who took the SAT who suffered during that period. For a time, educa¬ 
tional preparation got worse for everyone, as reflected in the Iowa data 
and the SAT national norm studies, not just for the college-bound 
tracks. But why was the size of the drop smaller and the rebound quicker 
and more complete for the population as a whole than for the SAT pop¬ 
ulation? And why, in the SAT population, do we observe such a large 
difference between Math, where decline was small and the recovery sub- 
stantial, and Verbal, where the decline was large with no apparent re¬ 
covery at all? Why were these contradictory trends most pronounced 
for the most gifted students? 

Competing Agendas 

Our explanation is consistent with the facts as we understand them, but 
we should emphasize that our explanation is interpretive as well. It goes 
like this: 

Since the late 1970s, the public dissatisfaction about the state of 
American elementary and secondary education has produced some 
changes. From 1982 to 1987, for example, the proportion of high school 
graduates who completed a solid program of four years of English, three 
of social Sciences, three of the hard Sciences, and three of math more 
than doubled. 43 The average course loads in all the academic areas went 
up, most dramatically in foreign languages but with sizable gains in Sci¬ 
ence and math as well. [44] Many people wanted higher Standards in their 
schools, and the schools tried to respond. 
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But other pressures were (and are) put on the schon Ls, and they 
created a gulf between what happened to courses in mathematics and 
tocourses in every other academic field. Ifaschool, trying to have higher 
Standards in math, began to require a basic calc.ulus course for its 
College prep students, there were limits to the amount ot fudging that 
could be done with the course content. Somehow a core of analytic 
techniques in calculus had to be part of the course. There was no way 
around it. Furthermore, there is a well-established Standard for decid- 
ing whether calculus has been learned: Can the Student solve calculus 
Problems? 

Another feature ot math skills at the high school level is that they can 
he increased independent of the students development in other inteb 
lectual skills. A Student may learn to manipulate quadratic equations 
even if he is given not a gl immer of how formal logic mighr re late to ex- 
pository prose or to the use of evidence in civics dass. It is good that math 
scores have risen, but it remains true that raising math Standards can he 
routinized in ways that cannot be applied to the rest of the Curriculum. 

How, for example, does one decide that the Standards for an English 
literature course have been “raised”? In the old days, it wouldn’t have 
been seen as a difficult question. Standards would be raised if the stu¬ 
dents were required to read a larger number of the Great Books (no one 
would have had much quarre 1 about what they were) or if students were 
required to write longer term papers, subject to stricter grading on ar- 
gumentation and documentation. But since the late 1960s, such 
struightforward ways of looking at Standards in the humanities, social 
Sciences, and even the physical Sciences were corrupted, in the sense 
that ehe Standards of each discipline were subordinated to orher con- 
siderations. Chief among these other considerations were multicultur- 
alism in the curriculum, the need to minimize racial differences in 
performance measures, and enthusiasm for fostering self-esteem inde¬ 
pendent of performance.We assume that a politically compromised 
curriculum is less likely to sharpen the verbal skills of students than one 
that hews to Standards of intellectual rigor and quality. We make these 
observations without belittling the issues that have been at center stage 
in American secondary education. But if the question is why the down- 
hill slide in verbal skills has not reversed, here is onc possible explana- 
tion: The agendas that have had the most influence on curricula are 
generally antagonistic to traditional criteria of rigor and excellence. 

These influencescome together when textbooks are selected by large 


school systems. A school board runs no risk whatsoever of angry histo- 
rians picketing their offices. They run grave risks of pickets (and of be- 
ing voted out of office) if a textbook offends one of the many interest 
groups that scrutinize possible choices. Publishers know the market and 
take Steps to make sure that their products will selb 

There are doubtless other culprits that help explain the difference 
between the recovery in math scores and the failure to recover in ver¬ 
bal scores. Television, rather than the printed page, became the primary 
medium for getting news and recreation at home after mid-Century, and 
that process was also reaching full flower in the 1960s. Telephones dis- 
placed letter writing as the medium for long-range communication. 
Such trends are hostile to traditional definitions of excellence in ver¬ 
bal skills. The simple hypothesis of this story is that these pressures ex- 
isted across the curriculum and in society at large but that math skills 
were less susceptible to them. (Math skills may instead have been get¬ 
ting a boost from the accessibility of Computers, calculators, and other 
high-tech gadgetry.) When parents demanded higher Standards, their 
sc.hools introduced higher Standards in the math curriculum that really 
were higher, and higher Standards in the humanities and social Sciences 
that really were not. 

The same dynamics provide a hypothesis for explaining why the re- 
bound was more complete for the nations overall Student population 
than for the SAT population. A textbook that is dumbed down is in fact 
helpful to the mediocre Student. A recent study of six textbooks over a 
twelve-year period demonstrated that they had indeed heen simplified, 
and students performed significantly better on the current, dumbed- 
down textsT Subjects that were traditionally not included in the cur¬ 
riculum for the lower end of the distribution—for example, exposure to 
serious literature—have now been so simplified as to be accessible to al¬ 
most all, 

The same durnbed-down textbook can quite easily have a depressing 
effect on the talented students development. And while the textbooks 
were being simplified, subjects that would push the best students to their 
limits, such as the classical languages, were all but dropped. Offered this 
diluted curriculum, talented students do not necessarily take the initia¬ 
tive to Stretch themselves. Plenty of students with high IQs will happily 
choose to write about The Hobbit instead of Pride and Prejudice for their 
term paper if that Option is given to them. Few of even the most bril- 
liant youngsters tackle the Aeneid on their own. 
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The Neglect of the Gifted 

Another factor in the declining capabilities of America’s brightest sru- 
dents is that the decline occurred when, in policy circles, disadvantaged 
students were “in” and gifted students were “out.” When the first sig- 
nificant aid went to secondary educarion at the end of the Eisenhower 
years, it was for the brightest students who might become scientists or 
engineers. In 1965, with the passage of the Elementary and Secondary 
Education Act of 1965 (ESEA), the funding priority turned ISOdegrees, 
and ithas remained anchored in the new position eversince. Asof 1993, 
the ESEA authorized forty-six programs with budgets that added up to 
$8.6 billion. Most of these programs are specifically designated for stu¬ 
dents in low-income areas and students with special educational needs. 
Even the programs that might apply to any sort of Student (improve- 
ments in Science and mathematics education, for example) offen are 
worded in ways that give preference to students from low-income areas. 
Another set of programs are for Support Services. And, finally, there are 
programs designated for the gifted and talented. This is the way that the 


$8.6 billion budget broke out for fiscal 199.3 

Programs for the disadvantaged 92.2% 

Programs that might benefit any Student 5.6% 

Support and administration of ESEA programs 2.1 % 

Programs for the gifted 0.1% 


This hreakdown omits other federal programs with large budgets aimed 
at the education of the disadvantaged—more than $2 billion for Head 
Start (funded by the Department of Health and Human Services, not 
the Department of Education), more than $3 billion for job rraining 
programs, plus a Scattering of others. 40 

Theoretically, programs targeted at disadvantaged students could also 
he programs for the cognitively gifted among the socioeconomically dis¬ 
advantaged. But that’s not the way it has worked. Disadvantaged as used 
by three decades of administrators and school boards using ESEA funds 
has consistently meant not just students who are poor or living in an in- 
ner-city neighborhood but students who exhibit learning problems. Pro¬ 
grams for the intellectually gifted but otherwise disadvantaged attract 
little support and, occasionally, hostility. A case in point is Banneker 
High School in Washington, D.C., a special academic high school in 


the middle of the black northeast section of the city, estahlished by a 
former Superintendent of schools with the school boards reluctant per- 
mission in 1981. 

The establishment of Banneker High followed a proud tradition in 
Washington, where once-elite Dunbar High had turned out many of the 
narions black leaders. But throughout the 1980s, Banneker was under- 
funded and repeatedly threatened with closure. Banneker was “elitest,” 
said an influential school board member, a luxury for parents who “had 
their children in private school and can no longer afford it and bring 
them back to essentially a private school at the public expense.” 49 Ban- 
neker’s “elitest” admissions policy? Applicanrs had to write an essay, be 
interviewed, be in the top 18 percent of their dass, and read and com- 
pute at grade level —a broad conception of “elitist” indeed. Throughout 
it all, teachers competed to teach at Banneker and students competed 
to artend. Banneker placed large proporrions of its graduates in College 
and had no significant problems with discipline, drugs, crime, or the 
other ills of Contemporary urban schools. v And yet, as we write, Ban¬ 
neker continues to be barely tolerated by the school System. Banneker s 
story has numerous counterpart.s in other urban centers. Funds for the 
economically and socially disadvantaged have meant, for practical pur- 
poses, funds concentrated on the cognitively disadvantaged as well. 

A POLICY AGENDA 

What are the implications for policy? The pros and cons of the specific 
reforms on the table—national achievement tests, national curricula, 
school choice, vouchers, tuition tax credits, apprenticeship programs, 
restoration of the neighborhood school, minimum competency tests, 
ability grouping, and a host of others—involve nuts-and-bolts issues 
that are better argued out in detail, on their merits, in works that are 
specifically devoted to them. We also leave for other settings a discus- 
sion of the enormous potential of new technologies, from the personal 
Computer to laser disks to the information superhighway, to enrich and 
hroaden educational resources. Here we concentrate on certain Strate¬ 
gie implications about educational reform that flow from our account— 
first, regarding attempts to Upgrade American education as a whole, and 
then regarding the education of the gifted. 
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Recdism About the Limits of General Improvements in Education 

We hegin with the first and most widely accepted conclusion: The ex¬ 
tern and quality of iearning for American students in general is low— 
lower than in most other industrialized countries but also (it would 
seem) low by basic Standards of what a person of ordinary ability ought 
to leam. Before jumping into any particular set of Solutions, however, 
policymakers need to he more realistic ahout what can he done to im- 
prove the education of students in a heterogeneous, nontotalitarian 
country. Specifically, critics of American education must come to terms 
with the reality that m a universal education System, many students will not 
reach the level of education that most people view as basic. Consider again 
the example of functional illiteracy mentioned earlier: that over 20 per- 
cent of 17-year-oLds are below the intermedtate reading level on the 
NAEP, meaning that they are marginal readers or worse. This is usually 
considered a failure of American education, and perhaps it is. Bur most 
of these nonreaders come from the bottom of the cognitive ability dis« 
tribution. How well should they he ahle to read after a proper educat ion, 
given the economic, technological, and political constraints on any Sys¬ 
tem of mass education? 

The United States has not yet completed the first half-century of 
human history in which universal secondary education became a goal. 
It was not until 1963 that the dropout rate feil below 30 percent of all 
17-year-olds. Already we have seen improving performance in acade- 
mic tests for the average Student as educational opportunities have 
spread across the population. At about the same time, educators—and 
educational critics—stopped thinking hard or openly about Variation 
in intellectual abilities. It is time to reopen the issue. What constitut.es 
educational success for persons at various points along the cognitive 
ability distrihution ? The aspirations of educational reformers should 
be accompanied by a realistic and systematic assessment of where 
the room for improvemenr lies, taking the cognitive distribution into 
account. 

Some critics blame students who do not work hard enough, rather 
than schools that tail to teach, for the shortcomings of American edu- 
cation. One hears repeatedly about students as couch potatoes. The av¬ 
erage American Student, it is said, takes the easy way out compared not 
only to the fabled Japanese but to children in countries such as Norway, 
the Netherlands, lreland, and ItalyT The obvious policy implication is 


to do something to make students work harder. Lengthen the school 
year. Lengthen the school day. Require homework every night. Toughen 
the gradingJ 521 The proposals fill the air. We think many of them are 
good ideas. But the closer one looks at the reasons why students do not 
work harder, the less it seems that they are to blame. 

First, most American parents do not <want drastic increases in the acade - 
rme teork load. Some of the evidence for this lies in quantitative survey 
data. In Harold Stevenson’s landmark cross-national study of Chinese, 
Japanese, and American education, 91 percent of American parents said 
their school is doing an “excellent or good job,” compared to only half 
that proportion of Taiwanese or Japanese parents."* 1 It has become a tru- 
ism in survey research: Americans teil Interviewers that American ed¬ 
ucation in general is going to the dogs, then in the next breath give high 
marks to their children’sown school. 154 * In surveys, many American par¬ 
ents are either apathetic about school or hostile tovvard more homework 
and tougher gradingT In this climate, more demanding Standards can- 
not easily he rmposed from ahove. 

But if you live near a public school, you need not search the techni- 
cal Journals to verify the point. Visit the school and talk to any teacher 
ahout the last half-dozen parents who have complained to him. For 
every parent who visits the principal to teil him that Johnny isn v t get- 
ting enough homework are several who visit to complain Johnny is be- 
ing overworked. Parents who are upset ahout inflated gradles seldom 
make a teacher’s life miserable. Parents who are upset about their childs 
low grade do. 

Parents do want orderly classrooms, no weapons, no violence, no 
drugs, and other safeguards for their children that many schools, espe- 
cially in large ciries, no longer provide. These urgent needs are fueling 
much of the shift into private schools and political backing for the 
“school choice” movement. But the average parent seems unprepared 
to support genuinely Stifter academic Standards. 

A second point is that the average American Student has little incentive 
to work harder than he already does in high school. Economist John Bishop 
has taken the lead in making this case, emphasizing two points.' 5<) Bishop 
first ohserves that a demanding high school curriculum is not necessary 
for admission to most Colleges. For most college-bound students, find- 
ing the money is harder than anvassing the necessary high school record. 
And it’s their parents who typically need to find the money. Why bother 
to take tough courses? This is true even of talented students applying to 
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selective schools; only a handful of schools at the suininit routinely turn 
away students with SATs in the 1200s and up (see Chapter 1). A Stu¬ 
dent who tests reasonably well (he knows rhis by the time he gets to 
high school) and doesn’t have his sights set on the likes of Yale does not 
have to be too careful about which courses to take as long as his grades 
are decent. Only youngsters who aspire to Colleges that usually take stu¬ 
dents with higher scores than their own have a strong incentive to study 
hard—and however common this Situation may seem at the school at- 
tended by the children of most of our readers, it descrihes a minuscule 
Proportion of the national high school population. 

Bishop also shows that achievement in high school does not pay off 
in higher wages or better jobs. Many employers assume that the high 
school diploma no longer means much more than that the Student 
warmed a seat for twelve years. Others are willing to look at high school 
transcripts as part of the hiring process, but though schools are legal ly 
obligated to respond to requests for transcripts, hardly any transcripts 
ever re ach the employer, and thosc that do usually arrive so late that 
they are useless. h7] Using the NLSY, Bishop found that better fest scores 
in Science, language arts, and math were associated with lower wages 
and employment among young men in the first ten years after high 
school. Students, like everybody eise, respond to what’s in it for rhem. 
Theres close to nothing in it for them in working hard in high school. 
Ergo, they do not work hard in high school. 

How might policy changes rcconnect high school performance with 
payoffs alter graduation? For students not conrinuing to College, Bishop 
recommends a variety of measures to certify competencies, to make trän- 
Scripts understandable and availahle to employers, and to hui Id up data 
hanks, national or regional (private, not federal), to enahle youths to 
send their “competency profile” to potential employers.™ 

Such programs may work il employers oi high school graduates had 
a shortage of competent workers applying for jobs. Some pilot projects 
are underway that should teil how much such data banks are needed and 
used. c But in thinking about linking up performance in high school 
with the job market, here is a dose of realism: When it comes to pre- 
dicting job productivity in most common jobs, an employer who rou¬ 
tinely trains new employees in specific job skills anyway hasn’t much 
reason to care about whether the applicant got an A or a C in high 
school English or, for that matter, how well the applicant did in high 
school vocational courses, except perhaps as a rough measure of how 


bright and conscientious the applicant is. On the average, and assum- 
ing no legal restrictions on testing, an employer can get a better idea of 
how well a job applicant will perform in job training by giving him an 
inexpensive twelve-minute intelligence test than by anything that the 
high school can teil the employer about the applicants academic 
record. 1611 This puts sharp limits on how interested employers will be 
high school performance. 

As far as Colleges are concerned, what incentive do they have to raise 
admissions requirements if it means fewer students? Düring and just af¬ 
ter the baby boom years, private Colleges added many students to their 
rosters and now face an oversupply of places for a shrinking market. Few 
prefer to go out of business rather than take students with modest cre- 
dentials. Public universities make their admissions policies in response 
to political pressures that generally push them toward more inclusive- 
ness, not less. When neither buyer nor seller profits from higher Stan¬ 
dards, why would Standards rise? 

Realism About How Federal Reforms Will Work in the American 
Context 

ln ways that few people want to acknowledge, America does not want 
its schools to take a large leap in whar they demand of youngsters. Our 
conclusion is that if parents, students, and employers do not broadly Sup¬ 
port a significantly more demanding educational System, it s not going 
to happen. Nonetheless, a variety of sensible reforms arc on the table— 
more homework, a longer school year, and the like. Why don’t we at 
least recommend that the federal government mandate these good 
things? On this question, the experience of the 1960s and 1970s serves 
as an object lesson for today. 

Educarional reformers in the 1960s and 1970s were confident that 
their ideas were good things to do. They were impatient with the con- 
servatism of local school districts. They turned to a responsive White 
House, Congress, and Supreme Court, achieved many of their objec- 
tives, and thereby contributed to a historic shift in American education. 
On balance, the turn was for the worse as far as academic excellence 
was concerned, but that doesn’t mean the ideas were had in rhemselves. 
Ideas such as more racial Integration in the schools, more attention to 
the needs of disadvantaged students, and more equitable treatment of 
students in disciplinary matters do not seem less obviously “good” to us 
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than ideals such as more homework and a longer school year. It. was not 
the Core ideas that were at fault (in most instances) hut some Basic Prob¬ 
lems that go wirb refnrming American education at a national level. 

We characterize the Situation as follows: Slow improvement seems to 
have heen a natural part ot twentieth-century American education uiv 
til the 1960s. This slow improvement had great inertia, in the sense that 
a slow-moving freight train has inertia. It is very difficult for an outside 
force to accelerate the freight train hut comparatively easy for an out¬ 
side force to derail it. ln the United States, the federal government tends 
to be an outside force, more offen derailing than pushing along, for rea- 
sons that are peculiarly American. 

In countries such as France and Germany, wirh more homogeneous 
populations and more authoritarian and unapologetically elitest educ.a- 
tional traclitions, the national government can get away with central- 
ized school Systems that educate their brightest youth well. In the 
United States, it cannot. Federal Standards, federal rules, and federal 
curricula, were they to be established, would inevitably be watered down 
and educational goals would be compromised with social and political 
ones. The federal government responds to pushes from all sides and gets 
equally nervous about aftirming the genius of either Huck Finn or 
Charles Darwin. Powerful teachers' organizations will not tolerate cer- 
lificarion tests that flunk large numbers of teachers. Organizations that 
represent minority groups will not tolerate national educational Stan¬ 
dards rhat cause large numbers of minority children to flunk. These are 
political facts of life that will not change soon, no matter who is in the 
White House. 

With Americas immense diversity and its tradition of local control, 
Washington is the wrong place to look for either energy or wisdom on 
educational reform. In our view, any natural impulse toward educational 
improvement will be best nourished by tetting the internal forces—the 
morivations of parents for their children and teachers for a satisfying ca- 
reer—have their head. We will state our recommendation in hroad 
terms: 

The federal goternmem should actively support programs that enahle all 
parents, not just affluent ones , to choose the school that their children attend. 
Current movements to provide inereased parental ehoice in schools are 
a hopeful sign, whether it he ehoice within the public school System, 
vouehers, or tuition tax credirs. Wirhout heing any more specific than 
that, vvc urge rhat inereased parental ehoice extend to private as well as 


public schools, and to religious private schools as well as secular ones. 

Will inereased parental ehoice help, given the modest academic goals 
that many parents have for their children? There are reasons for think- 
ing it will. First, the learning that goes on in a school depends on the 
school environment as well as on its curriculum. Here, the great ma- 
jority of parents and teachers stand on common ground. Orderly dass- 
rooms and well-enforced codes of behavior do not need to he mandated 
hut simply permitted; parents, teachers, and administrators alike will see 
to it, if the control they once had over their schools is returned to them. 
To have America’s children, poor as well as rieh, once again attending 
safe, orderly schools would be no small achievement and would likely 
foster more learning than the often chaotic public schools do now. 

Gifted youngsters would also benefit by restoring local control. While 
most parents do not want an authentically tougher education for their 
children, some do, and they tend to he concentrared among the parents 
of the brightest. Folicy should make it as easy as possible for them to 
mateh up with classes that satisfy their ambitions. 

To the extent that the government succeeds in this first goal, the oth- 
ers that we have in mind become less important. But as long as the cur¬ 
rent Situation prevails, in which federal money and the conditions 
surrounding it play a major role in shaping public education, we ree- 
ommend two other measures: 

A federal pme scholarship program . This is one instance in which a spe¬ 
cific, federal program could do some good in restoring educational ex- 
cellence. As the law Stands, federal scholarships and loan assistance are 
awarded almost. exclusively on the basis of financial need, leaving the 
administration of Standards to the Colleges that admit and teach the 
students. That program may continue as is, but Congress should add a 
second program, not contingent on financial need but awarded com- 
petirively—for example, a flat one-time award of $20,000 to the 25,000 
students in the country earning the top scores on Standard ized tests of 
academic achievement, over and above whatever scholarship assistance 
the Student was receiving from other sources. How much would such 
“American Scholars” (the Congress might call them) cost? Five hun¬ 
dred million dollars a year—an amount equivalent to a rounding error 
in the national budget hut one that would dramatically transform the 
signal that the federal government sends about the value it places on 
academic excellence . 1621 

Reallocate some portion of existing elementary and secondary school fed - 
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eral aid away from programs for the disadvantaged to prograrm for the gifted. 
The objecrive is to make sure that public school Systems have roughly 
the same capability to provide for students at the high end of the dis- 
tribution as they have for helping students at the low end. A collateral 
part of this reform. should be to rescind any federal regulations or grant 
requirements that might discourage local school Systems from experi- 
menting with or supporting programs for the gifted. At present, there is 
an overwhelming tilt toward enriching the education of children from 
the low end of the cognitive ability distribution. We propose more of a 
balance across the cognitive ability distribution. 

Restoring the Cuncept of the Educated Man 

Why should the federal government shift money from programs for the 
disadvantaged to programs for the gifted, when we know that a large 
Portion of the gifted come from privileged families? Why not just Sup¬ 
port programs for the gifted who happen to come from poor families as 
well • ln p art I, we went to some lengths to describe the dangers of a cog¬ 
nitive elite. And yet here we call for Steps that could easily increase the 
Segregation of the gifted from everyone eise. Won’t programs for the 
gifted further isolate them? 

The answers to such questions have nothing to do with social justice 
but much to do with the welfare of rhe nation, including rhe ultimate 
welfare of rhe disadvantaged. 

The first point echoes a continuing rheme of this Book: To be inrel- 
lectually gifted is indeed a gift. Nobody “deserves” it. The monetary and 
social rewards that accrue to being intellectually gifted are growing all 
the time, for reasons that are easily condemned as being unfair. Never 
mmd, we are saying. These gifted youngsters are important not because 
they are more virtuous or deserving but because our society’s future de- 
pends on them. The one clear and enduring failure of Contemporary 

American education is at rhe high end of the cognitive ability distrib- 
ution. 

Ideal ly we would like to see the most gifted children receive a de- 
manding education and artend school side by side with a wide ränge of 
children, learning firsthand how the rest of the world lives. Bur that Op¬ 
tion is no more available now rhan it was during the attempts to force 
the racial integration of urban schools in the 1960s and 1970s. The na- 
rion’s elementary and secondary schools are highly segregated by so- 
cioeconomic Status, they will tend to become more so in the future, and 


rhe forc.es pushing these trends are so powerful, stemming from the 
deeply rooted causes that we described in Part 1, that they can be re- 
versed only by a level of state coercion that would be a eure far deadlier 
than the disease. 

Most gifted students are going to grow up segregated from the rest of 
society no matter what. They will then go to rhe elite Colleges no mat¬ 
ter whar, move into successful careers no matter what, and eventually 
lead the institutions of this country no matter what. Therefore, the na¬ 
tion had hetter do its damnedest to make them as uuse as it can, If they 
cannot grow up knowing how the rest of the world lives, they can at 
least grow up with a proper humility about their capacity to reinvent 
the world de novo and thoughtfuily aware of their intellectual, cultural, 
and ethical heritage. They should be taught their responsibilities as cit- 
izens of a broader society. 

The educational deficit that worries us is symbolized by the drop in 
verbal skills on the SAT What we call verbal skills encompass, among 
other things, the ability to rhink about difficult problems: to analyze, 
pick apart, disaggregate, synthesize, and ultimate ly to understand. It has 
seldom been more apparent how important it is that the people who 
count in husiness, law, politics, and our universiries know how to think 
ahout their problems in complex, rigorous modes and how important ir 
is that they bring to their thinking depth of judgment and, in the lan- 
guage of Aristoile, the hahit of virtue. This kind of wisdom—for wis- 
dorn is what we need more of—does not come naturally with a high IQ. 
It has to be added through education, and education of a particular kind. 

We are not talking about generalized higher Standards. Rather, we are 
thinking ol the classical idea of the “educated man”—whieh we will 
amend to “educated person”—in which to be educated meant first of all 
to master a core body of material and skills. The idea is not wedded to 
the specific curriculum that made an educated man in the ninereenth- 
century British public school or in the Greek lyceum. But it is wedded 
to the idea of certain high intellectual goals. For example, to be an ed¬ 
ucated person meant being able to write competenrly and argue logi- 
cally. Therefore, children were taught the inner logic of grammar and 
syntax because that kind of attention to detail was helieved to carry over 
to greater precision of thinking. They were expected to learn Aristotle s 
catalog of fallacies, because educators understood rhat the ability to as¬ 
sess an argument in everyday life was honed by mastering rhe formal el- 
ements of logic. Ethics and theology were part of the curriculum, to 
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teach and to refine virtue. We will not try to prescribe how a Contem¬ 
porary curriculum might be revised to achieve the same end.s, beyond a 
few essentials: To be an educared person must mean to have mastered a 
eure of history, literature, arts, ethics, and the Sciences and, in the 
process of leaming those disciplines, to have been trained to weigh, an- 
alyze, and evaluate according to exacting Standards. This process must 
begin in elementary school and must continue through the university. 

Our proposal will sound, and is, elitist, but only in the sense that, al¬ 
ter exposing students to the best the world’s intellectual heritage has to 
offer and challenging them to achieve whatever level of excellence they 
are capable of, just a minority of students has the potential to hecome 
an educated person as we are using the term. 1t is not within every- 
one’s ability to understand the world’s intellectual heritage at the same 
level, any more than everyone who enters College can expect to be a 
theoretical physicist by trying hard enough. At every stage of leaming, 
some people reach their limits. This is not a controversial Statement when 
it applies to the highest levels of leaming. Readers who kept taking 
marhematics as long as they could stand it know that at some poim they 
hit the wall, and studying hard was no longer enough. 

The nation has been unwilling to accept in recent decades that the 
same phenomenon of individual limitation applies at every level of ed- 
ucation. Given the constraints of time and educational resources, some 
students cannot be taught Statistical theory; a smaller fraction of stu¬ 
dents cannot be taught the role of mercantilism in European history; for 
even a smaller fraction, writing a coherent essay may he out of reach. 
Each level ol accomplishment deserves respect on its own merits, but 
the ideal of the educated person is in itself an ideal that must be em- 
hraced openly. Ry abandomng it, America has been falling short hoth 
in educating its most gifted and in incuicating, across the entire cogni- 
tive distribution, the values we would want in an educated citizenry. 

But what do we want to du? What courses should be required of ed¬ 
ucated persons? Do we want to have separate schools for the gifted and 
average Student ? Tracking Systems? A national Great Books curriculum ? 

We will say it again: Different parents will want to make different 
choices for their children. We are not wise enough—and neither are 
any of our oolleagues wise enough, nor is the federal government wise 
enough—to prescribe for them what is best for their children. The goal 
of developing educated persons, like the goal of improving American 


Educated, Not Credentialed 


If we have nur already made it plain, lec us srate explicitly that we are 
proposing a tradirional ideal of education, not glorifying academic cre- 
denrials. To he an educated person as we use the term will ordinarily en- 
tail getting a degree, but that is incidental. Credentialism—unnecessarily 
limiting access to johs to people with certain liccnses and degrees—is part 
of the problem, not a solution. Recause academic credentials are over- 
valued, America shies away Irom accepting that many people have acad¬ 
emic limirations—hence, the dumbing down that holds hack the brightest 
youngsters. 


education in general, will best be served by letting parents and local 
communities make those choices. 

But parents and communities must tum to educators to implement 
their hopes for their children, and here is the problem: Too few educa - 
tors are comfortable with the idea of the educated person. A Century ago 
the notion of an educated person was an expression of a shared under- 
standing, not of legal requirements. That understanding arose heeause 
people were at ease wirb intellectual Standards, with rigor, with a recog- 
nirion that people differ in their capacities. The criterion for being an 
educated person did not have to be compromised to include the suppo- 
sition that everyone could meet it. The concept of the educated person 
has been out of fashion with the people who run elementary and sec- 
ondary schools and, for that matter, with too many of the people who 
run universities. 

Our policy goal? That educators who read these words change their 
minds. It is a re form that is at once impossihle to legkslate but requires 
no money at all. It a reform that would not jeopardize the educational 
advances of the average Student. All that we ask is that educational lead- 
ers rededicate themselves to the duty that was once at the heart of their 
calling, to demand much from those fortunate students to whom much 
has been given. 




Chapter 19 


Affirmative Action in Higher 
Education 


Affirmative action on the campus needs , at last, tu be discussed as it is actu- 
ally practiced, not as the rhetoric portrays it. Our own eff ans to assemble data 
on a secretive process lead us to condude that affinnative action as it is prac- 
ticed cannot survive public scrutiny. 

The edge given to minority applicants to College and graduate school is not 
a nod in their favor in the case of a close call but an extremdy large advan- 
tage that puts black and Latino candidates in a separate admissions competi - 
tion. On dite carnpuses , the average black freshman is in the region of the 
lOth to I5th percentile of the distribution of cognitive ability among white 
freshman. Nationwide , thegap seems to be at least that large , perhaps larger. 
The gap does not diminfsfi in graduate school. If anything , it may be larger. 

In the world of College admissions , Asians are a conspicuously unprotected 
minority. At the dite schools, they suffer a mock’st penalty, with the average 
Asian freshman being at about the 60th percentile of the white cogriidve abiL 
ity distribution. Our data from state unwersiües are too sparse to draw com 
clusions. In all the available cases, the difference between white and Asian 
distributions is small (either plus or minus) compared to the large differences 
separating blacks and Latinos from whites. 

The edge given to minority candidates could be more easily defended if the 
competition were between disadvantaged minority youths and prwileged white 
;youths. But nearly as large a cognitive difference separates disadvantaged 
black freshmen from disadvantaged white freshmen. Still more difficult to de - 
fend } blacks from affinem socioeconomic backgrounds are given a substantiell 
edge over disadvantaged whites. 

There is no question that affirmative action has “workedf ’ in the sense that 
it has put more blacks and Latinos on College carnpuses than would otherwise 
have been there. But this success must he measured against costs. When stu - 
dents look around them, they see that blacks and Latinos constitute small pro - 
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portions of the Student Population but high proportions of the Student s dumg 
poorly in school. The psychological consequences of this disparity may he pan 
of the explanationfor the increasingracial animosity and the high black dropout 
rates that have troubled American campuses. In society at large , a College de - 
gree does not have the same meaningfur a minority graduate and a white one, 
with consequences that reverberate in the workplace arid continue throughout 
life. 

It is time to re tum to the original intentions of affirmative action: tu cast a 
wider net, to give preference to members of disadvantaged groups, whatever 
their skin color, when qualifications are similar. Such a change would accord 
more closely with the logic underlyingaffirmative action, with the needs of to - 
day’s students of all cthnic groups, and with progress toward a healthy muh 
tiracial society. 


W c come to national policies that require people to rrear groups 
ditferently under the law. Affirmative action hegan to he woven 
into American employment and educational practices in the 1960s as 
universities and employers intensified their recruiting of blacks■—ini- 
tially on their own, then in compliance with a vvidening hody of court 
decisions and laws. By the early 1970s, affirmative action had been ex- 
panded beyond blacks to include women, Latinos, and the Jisahled. 1t 
also became more aggressive. Targets, guidelines, and de facto quoias 11 * 
evolved as universities and employers discovered that the equality of 
outcome that people sought was not to be had from traditicsnal recrutr- 
ing methods. As it became more aggressive, affirmative action became 
correspond ingly more controversial. 

Affirmative action creates antagonism partly because it afteers the 
disrrihution of scarce goods—university places, scholarships, joh öfters, 
and promotions—that people prize. Rut it is also prohlematic for rea- 
sons that reaoh into deeply held beliefs—mosr fundamental ly, beliefs 
about the ideal of equal opportunity versus the reality of the histortcal 
experience of certain groups, preeminently hlacks, in this country. As 
the rhetoric heats up, the Arguments ahout affirmative action become 
hlurred. Affirmative action raises different questions in different con- 
texts. Whar, people ask, are the proper goals of affirmative adion, the 
proper methods? Which groups are to he benefited? Whar are the costs 
of affirmative action, and who should bear them? Is affirmative action 


a temporary expedient to correct past wrongs, or must the American 
ideal of individualism be permanently modified for the collective needs 
of members of certain groups? 

Affirmative action is part of this book because it has been based on 
the explicit assumption that ethnic groups do not differ in the abilities 
that contribute to success in school and the workplace—or, at any rate, 
there are no differences that cannot be made up with a few remedial 
courses or a few months on the Job. Much of this book has been given 
over to the many ways in which that assumption is wrong. The impli- 
cations have to he discussed, and that is the purpose of this chapter and 
the next, augmented hy an appendix on the evolution of affirmative ac¬ 
tion regulations (Appendix 7). Together, these materials constitute a 
longer discussion than we devote to any other policy issue, for two rea- 
sons. First, we are making a case that contradicts a received wisdom em- 
bedded in an intellectual consensus, federal legislation, and Supreme 
Court jurisprudence. If the task is to be attempted at all, it must be done 
thoroughly. Second, we believe affirmative action to be one of the most 
far-reaching domestic issues of our time—not measured in its immedi- 
ate effects, but in its deep and pervasive impact on America’s under- 
Standing of what is just and unjust, how a pluralist society should be 
organized, and what America is supposed to stand for. 

ln this chapter, the topic is the College campus. In Chapter 20, we 
discuss affirmative action in the workplace. In both chapters, we pro- 
vide data as available on Asians and Latinos, but the analysis centers 
on blacks, as has the dehate over affirmative action. 

THE “EDGE” IN AFFIRMATIVE ACTION 

People may agree that they want affirmative action in higher education 
until they say more precisely what they mean by it. Then they may dis- 
agree, But whatever the argument, it would help to have some data 
about how Colleges and universities have translated the universal desire 
for greater fairness in university education into affirmative action pro- 
grams. Our first goal is to inform the debate with such data. 

At first glance, ours may seern an odd objective, for certain kinds of 
data about affirmative action are abundant. Universities and businesses 
keep detailed numbers about the numbers of minorities who apply and 
are accepted. But data about the core mechanism of affirmative action 
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the magnitudes of the values assigned to group membership—are not 
part of the public debate. 

This ignorance about practice was revealed in 1991 by a law Student 
at Georgetown University, Timothy Maguire, who had been hired to file 
Student records.“ He surreptitiously compiled the entrance statistics for 
a sample of applicants to Georgetowns law school and then published 
the results of bis research in the law schools Student newspaper. He re- 
vealed that the mean on the Law School Aptitude Test (LSAT) differed 
by a large margin for accepted black and white students. 

In the storm that ensued, the dean of the law school ordered copies 
of the newspaper to be confiscated and black Student groups called for 
Maguire’s expulsion. Hardly anyone would acknowledge that Maguire’s 
numbers even raised a legitimate issue. “Incomplete and distorted in- 
formation about minority qualifications for admission into the Law 
Center renew the long-standing and intellectually dishonest myth that 
they are less qualified than their white counterparts to compete in 
school, perform on the job or receive a promotion,” wrote the authors 
of an op-ed article in the Washington Pos t / and that seemed to be the 
prevailing attitude. The numerical magnitude of the edge given to mem- 
bers of certain groups—the value assigned to the state of being black, 
Latino, fernale, or physically disabled—was not considered relevant. 

Such edges are inherent in the process. In as neutral and precise lan- 
guage as we can devise: Perfectly practiced, the traditional American 
ideal ofequal opportunity means usingexclusively individual measures, 
applied uniformly, to choose some people over others. Perfectly prac- 
ticed, affirmative action means assigning a premium, an edge, to group 
membership in addition to the individual measures before making a fi¬ 
nal assessment that chooses some people over others. 

The size of the premium assigned to group membership—an ethnic 
premium when it is applied to affirmative action for favored ethnic 
groups—is important in trying to judge whether affirmative action in 
principle is working. This knowledge should be useful not only (or even 
primarily) for deciding whether one is “for” or “against” affirmative ac¬ 
tion in the abstract. 1t should be especially useful for the proponents of 
affirmative action. Given that one is in favor of affirmative action, how 
may it be practiced in a way that conforms with one s overall notions of 
what is fair and appropriate? If one opposes affirmative action in prin¬ 
ciple, how much is it deforming behavior in practice? 

It is not obvious precisely where questions of fact trail into questions 


of philosophy, but we will attempt to stay on the factual side of the line 
at first. A bir of philosophical speculation is reserved for the end of the 
chapter. We first examine evidence on the magnitude of the ethnic pre¬ 
mium from individual Colleges and universities, then from Professional 
schools. We then recast the NLSY data in terms of the rationale un- 
derlying affirmative action. We conclude that the size of the premium 
is unreasonably large, producing differences in academic talent across 
campus ethnic groups so gaping that they are in no one’s best interest. 
We further argue that the current practice is out of keeping with the ra¬ 
tionale for affirmative action. 

The Magnitude of the Edge in Undergraduaie Schools 

We have obtained SAT data on classes entering twenty-six of the na- 
tion’s top Colleges and universities. ln 1975, most of the nation’s elite 
private Colleges and universities formed the Consortium on Financing 
Higher Education (COFHE), which, among other things, compiles and 
shares Information on the students at mernber institutions, including 
their SAT scores. We have obtained rhese data for the classes entering 
in 1991 and 1992. 4 They include sixteen out of the twenty top-rated 
private universities and five of the top ten private Colleges, as ranked in 
US. News and World Report for 199V The figure below shows the dif- 
ference in the sum of the average Verbal and Math SAT scores hetween 
whites and two minorities, blacks and Asians, for the classes in the 
COFHE schools that matriculated in the fall of 1992. ln addition, the 
figure includes data on the University of Virginia and the University of 
California at Berkeley in 19880 

The difference between black and white scores was less than 100 
points at only one school, Harvard. It exceeded 200 poinrs at nine 
schools, reaching its highest at Berkeley (288 points). Overall, the me¬ 
dian difference between the white mean and the black mean was 180 
SAT points, or, conservatively estimated, about 1.5 Standard devia- 
tions. 1 ' 1 This would put the average black at about the lOth percentile 
of white students. In all but tour schools, Asians were within 6 points 
of the white mean or above it, with a median SAT 50 points above the 
local white average, working out to about .2 Standard deviations. Or in 
other words, the average Asian was at about the 60th percentile of the 
white distribution. This combination means that blacks and Asians 
have even less overlap than blacks and whites at most schools, with the 
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At selective schools, the median black edge was 180 SAT points, 
while Asians faced a median penalty of 30 points 


-300 


SAT-point difference from the white mean 
Blacks Asians 


70 



Rice 

Berkeley 
Univ. of Virginia 
Dartmouth 
Oberl in 

Univ. of Rochester 

Wesleyan 

Univ. of Chicago 

Stanford 

Columbia 

Duke 

Trinity 

Williams 

Northwestern 

Johns Hopkins 

Wellesley 

Swarthmore 

Amherst 

Prineeion 

Brown 

Cornell 

U Penn 

Harvard 

Georgetown 

MIT 

Washington 
+ 100 


Snums: Omsorrium on Finnncing Higher Education 1992; Sarich 1990 (for Berkeley), 1.. 
Ivinherg, “Black (reshman cmollmmt riscs 46% ar U-Va,” Washington Post, l\-c. 26, 19ö«' 
p. CI (lor University of Virginia). 


median black at the 5th to 7th percentile of the distrihurion of Asian 
students. Data for Latinos (not shown in the figure) put them between 
blacks and whites, with a median of 129 points below the white mean, 
or about .9 Standard deviation below the white mean in the typical case. 
The average Latino is therefore at about the 20th percentile of the dis- 
tribution of white students. 181 


The ordering of black, Latino, white, and Asian is similar to that re- 
ported tot IQ and SAT scores in Chapter 1 3. ln other words, elite uni- 
versities are race norming (though it is doubtful they think of it that 
way), carrying with them into their Student populations the ethnic dif- 
ferences in cognitive distributions observed in the population at large. 

We would preier to have a sample of nonelite state universities rep- 
resented in our data, but such numbers are c lose ly guardedJ gi The only 
data we have obtained come from the University of California at Davis, 
tot 1979. The black-white difference then was 271 SAT points, and the 
Larino-white difference 211 points. 1101 The Asian mean at Davis was, 
atypieally, 54 points he low the white mean, the 1 arges t such difference 
we have found. 

The data from the University of Virginia and the two University of 
California campuses suggest that the gap between minoriries and whites 
among besinnen at state universities may be larger than at the elite pri¬ 
vate schools. It is only a Suggestion, given the limited data, but it also 


Are Asians the Victims of Reverse Discrimination? 


Complaints that Asian-Ameriean applicants were being subjected to re- 
\'erse discrimination led eventually to a fulbscale inquiry in the late 1980s 
by the federal Office for Civil Rights. Harvard, which was examined 
clo.se ly, was ahle to show that the SAT penalty of their Asian admitted stu¬ 
dents w'as accounted for hy the smaller n um her of alumni children and arh- 
letes in the pool, and eventually gor a clean bill of healrh, but the 
controversy teinains at many other institurions. 11 Brown responded to a re- 
port trom its Asian-Ameriean Students Association by admitting the ex- 
istence of “an exrremely serious Situation“ and called for “immediate 
remedial mensures.” 12 At Berkeley, Stanford, Princeton, and other elite 
schools, special committees have investigated the issue, issuing reports that 
rend to exonerate their Colleges of actual reverse discrimination hut ac- 
knowledge shortcomings in keeping up with rhe revolution in Asian ap¬ 
plicants. 1 ' 

The underlying source of tension remains: Asians are an ethnic minor- 
iry, many of whom, or whose parents, came to ehe United States ander cir- 
cumstanees of extreme deprivation. Many suffered trom racial prejudice. 
Whether or not they are treated differcnrly from whites hy elite universi- 
ties, Asians are indisputably treated differently from every other nonwhite 
ethnic minoriry. University officials everywhere have been reluctant to 
confront this issue forthrightly. 
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makes sense: Places like Harvard, Stanford, Yale, and MIT get first pick. 
Because the raw numbers of high-scoring black and Latino students arc 
so small, the top schooLs dig deep into the thin layer of minority stu¬ 
dents at the top of the SAT distrihution. ln 199.3, for example, only 129 
blacks and 234 Latinos nationwide had SAT-Verbal scores in the 700s— 
and these represented all-time highs—compared to 7,114 whites. Even 
highly rated state institutions such as the University of Californias 
Berkeley campus and the University of Virginia lose many of these most 
talented minority students to the elite private schools while continuing 
to get many of the top scorers in the larger white pool. Such are the 
mathematics of competition for a scarce good, borne out by the limited 
university data available, which show the three state universities with 
three of the four largest black-white gaps in SATs. 


The Law of Supply and Demand in Minority Recruiting 


Affirmative aetion has produced intense competition for the top black and 
Latino students. In the spring of 1992, Harvard reported rhat its “yield” of 
black students abruptly declined from the year before. The Harvard report 
suggested that the decline was due at least in part to the large financial in- 
centives being offered to blacks by other Colleges. One such black Student, 
it was reported, received a straight grant of $85,000, plus $10,000 in am 
nual travel budgets, from one of Harvards competitors in minority re- 
cruiting. 14 An article in the New York Times provided more instances of a 
practice that increasingly includes the kind of enticements—full scholar- 
ships even for families with ample financial resources, free trips to visit the 
campus, recruiting visits, and promotional activities—that used to be re- 
served for star high school athletes. “As a result, a number of College ofti- 
cials private ly accuse each other of \srealing’ black students,” the Times 
re porter notedV 


The differences do not: seem to have changed a great deal berween 
the 1970s and rhe 1990s. The best longitudinal data from Berkeley il- 
luscrate a perverse effect of a strong affirmative aetion policy: The more 
aggressive the reeruitment of minorities, the higher rhe average ahility 
of the nonminority students. From 1978 to 1988, rhe combined SATs of 
blacks at Berkeley rose by 101 points, a major improvement in the aca- 
demic quality of black studenrs at Berkeley. Bur the competition for the 
allotment of white slots became ever more intense. The result was that 


the SAT scores for Berkeley whites rose tot), and the gap between black 
and white students at Berkeley did not dose but widened.* 161 Meanwhile, 
rhe unprotected minority, Asians, also were competing for a restricted 
allotment of slots. Their mean scores rose more than any other group s, 
and by a large margin, going from far below the white mean to slightly 
above it. In just eleven years, the Asian mean at Berkeley soared by 189 
points. 

The summary Statement ahout affirmative aetion in undergraduate 
institutions is that being either a black or a Latino is worth a great deal 
in the admissions process at every undergraduate school for which we 
have data. Even the smallest known black-white difference (95 points 
at Harvard) represents close to a Standard deviation for Harvard uiv 
dergraduates. The gap in most Colleges is so large thar the black and 
whire Student bodies have little overlap. The Situation is less extreme 
for Latino students but still severe. Asian students appear to suffer a 
penalty for being Asian, albeit a small one on the average. We have seen 
no data that woukl dispute this picture. lf such data exist, perhaps this 
presenrarion will encourage their publication. 

The Magnitude of the. Edge in Graduate Schools 

Law S( :HOOI,s. Timothy Maguire’s findings ahout the Georgetown Law 
Center were consistent with more systematic evidence. The tahle be¬ 
low shows the national Law School Aptitude Test (LSAT) results for 
1992 for registered first-year law students. For blacks, overlap wich the 
white incoming law students was small; only 7 percent had scores above 
the white mean. The overall Latino-white difference was 1 Standard de¬ 
viation. It was markedly larger for Puerto Ricans (-2.0 SDs) than for 


Affirmative Action Weights: 

The Law School Aptitude Test 

Ethnie Group 

Difference from 
White Mean, 
in SDs 

Asian/Pacific 

-.32 

Blacks 

-1.49 

Latinos 

-1.01 

Soura*: Barnes and Carr 

199V 
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Mexican-Americans (-.8) or “other” Latinos (-.7). The overall Asian 
mean corresponds to the 38th percentile on the white distribution, ev- 
idence of modest affirmative action on behalf of Asian applicants in the 
law schools. 

The table above is for the national population of first-year law 
students. To assess the effects of affirmative action, it would be prefer- 
able to have data from individual law schools. At upper reaches of the 
LSAT distribution, from which the elite law schools drew most of their 
students, there was even less overlap between whites and blacks than in 
the SAT pool. More than 1,100 registered white law students had scores 
of 170 or higher on a scale going from 120 to 180, compared to three 
blacks. At ten highly selective law schools for which individual data 
were reported in a 1977 report by the Law School Admissions Council, 
the smallest black-white difference in LSAT scores (expressed in terms 
of the white distribution) at any of the ten schools was 2.4 Standard de- 
viations, the largest was 3.6 Standard deviations, and the average differ¬ 
ence for the ten schools was 2.9 Standard deviations, meaning that the 
average black was in the hotrom 1 percent of the white distribution. 1171 

Medical Sch(X)LS. Medical students repeat the familiär pattem, as 
shown for the national population of matriculated first-year students in 
1992 in the table below. In the national pool, the black-white gap is 


Affirmative Action Weights: 

The Medical College Admissions Test 


Ethnie 

Group 

Blacks 


Difference from the White Mean, in SDs 
Biological Physical Verbal 

Sciences Sciences Reasoning 

-1.36 -1.26 -1.40 


“Other under-represented 


minorities”' 1 -.75 -.84 -.84 

“Üther” b +.04 +.15 -45 


Source; Division of Educational Research and Assessment 1993, pp. 59-63. 

■' “Other under-represented minorities” consists of American Indian/Alnskan 
natives, MexicamAmerican/Chicanos, and mainland Puerto Rica ns. 

b Asian/Pacific, Commonwealth Puerro Ricans, and Latinos not otherwise 
classified. 


ahout the same as in the law schools, with the average entering black 
medical Student at the 8th to lOth percentile of the white distribution, 
depending on which subtest of the Medical College Admissions Test 
(MCAT) we consider. The gap between whites and “other underrepre- 
sented minorities” is a hit smaller than the Latino-white gap in law 
school, with the average Student in this group Standing at the 20th to 
23d percentile of the white distribution. The “other” caregory—mostly 
Asian—had higher scores than whites on the physical Sciences and 
(fractionally) on hiological Sciences, Standing, respectively, at the 56th 
and 52d percentiles of the white distribution, while scoring lower in ver¬ 
bal reasoning (32d percentile). 

As in the case of law schools, the black medical Student pool is even 
more severe ly depleted at the top end of the ränge than it is in under- 
graduate schools, with important impli carions for the gap in the elite 
schools. ln none of the three suhtests did more than 19 blacks score in 
the 12 to 15 ränge (on a scale that goes from 1 to 15), compared to 1,146, 

1,469, and 853 whites (for the hiological Sciences, physical Sciences, and 
verbal reasoning tests, respectively). 1181 In practical terms, several of the 
elite schools can fill their entire dass with white students in the top 
ränge, hut only the one or two most elite schools can hope to have a sig- 
nificant number of black students withour producing extremely large 
black-white differences, comparable to those reported for elite law 
schools. 

Other studies have published data on medical school admissions, ex¬ 
pressed in terms of the odds of being accepted to medical school for dif¬ 
ferent minorities. All teil similar srories to ours. lwl 

Graduate Schools in tue Arts and Sciences. Applicants to gradu- 
ate schools other than law and medicine typically take the Graduate 
Record Examination (GRE), comprising verbal, quantitative, and ana- 
lytical suhtests. The reports of GRE scores do not distinguish between 
persons who take the tost and persons who actually reg ist er in a gradu- 
ate school, so they are less useful than the LSAT or MCAT in trying to 
Linderstand the scope and magnitude of affirmative action in those 
schools. Nonetheless, the results, in the table helow, lock familiär. The 
magnitudes of the ethnic differences on the individual suhtests of the 
GRE (in 1987-1988, the most recent year for which we were given data) 
were somewhat smaller than for the Professional schools, putting blacks 
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at the lOth to 12th percentile of the white distribution, depending on 
the subtest. Asians were (as usual) higher than whit.es on the quantita- 
tive and lower on the verbal. Adding up all three subtest means, Asians 
were a few points higher than whites. 


Applicants to Graduate Schools 

Difference from the White Mean, in SDs 


Ethnic Group 

Verbal 

Quantitative 

Analytical 

As ian/Pacific 

-.37 

+.52 

-.15 

Blacks 

-1.20 

-1.19 

-1.29 

Latino 

-.74 

-.46 

-.54 


Sowrct': Wah and Robinson, 1990, Table 2.2. 


1 he summary Statement is thar the ethnic gaps in objective test seores 
observed in Lindergraduate institutions are matched, and perhaps ex-* 
ceeded, in graduate and Professional schools. If data become available 
from individual schools, this question can he answered definitively. 

AFFIRMATIVE ACTION AS PART OF THE ADMISSIONS 
PROCESS 

The data we have just summarized should restrain casual asserrions that 
the differences among the blacks, Latinos, Asians, and whites who go 
to College are not worth worrying about. The differences we have de- 
scribed are large by any definition. But do rhese data give us any lever- 
age on the question of whether affirmative action as it is currenrly 
practiced is good or bad? For an ans wer, we begin by inquiring into the 
logic of affirmative action and then examine whether the pattems of 
racial and socioeconomic differences observed in the NLSY make sense 
in terms of that logic. 


The Logic of College Admissions 

On the campus, affirmative action is not at odds with the normal ad- 
missions process. College admission is not, has never been, nor is rhere 
reason to think it should be, a competition based purely on academic 
merit. The nonacademic ends can be legitimate and important. No ad¬ 
missions policy can serve all good ends equally, because the ends are of- 
ten inconsistent with one another. The admissions process is a juggltng 
act, and affirmative action fits squarely in a long tradition. Our under- 
Standing of the legitimate role of affirmative action, which owes much 
to Robert Klitgaard’s discussion of the saine topic, will be categorized 
under the headings of “institutional benefit," “social utility," and “just 
desertsA 0 

INSTITUTIONAL Blnekit. One of the goals of any admissions process is 
to serve the institution’s own interests. Why do many Colleges give some 
preference to students from faraway States? To children of alumni ? 1211 To 
alhstate linebackers or coneert pianists? Seme of the answers involve 
the good of the Institution as a whole. A Student from Montana can add 
Jiversity to a College in Connecticut; a good football team can 
strengthen a Colleges sense of community and perhaps encourage 
alumni generosity. Black and Latino students admitted under affirma¬ 
tive action can enrich a campus by adding to its diversity. 

The institution also has interests beyond daily campus life. Admit- 
ting the children of its faculty and of its mast generous alumni may add 
little that is distinctive to the Student hody, for example, but their par- 
ents make a big difference to the health and quality of the institution, 
and keeping them happy is important. Beyond the College gates is soci- 
ety at large. Universities cannot disregard what the broader community 
rhinks of them, and so they must be sensitive to the currents of their 
time. The political pressure (let alone the legal requirement) for some 
level of affirmative action in the universities has been irresistible. 

These institutional interests are valid and significant but unsatisfac- 
tory as the entire rationale for affirmative action, for there are too many 
ways in which affirmative action has self-evident drawhacks. If it is ad- 
missible to augment the presence of some racial or ethnic minorities 
solely because they serve the interests of the university, is it not also ap- 
propriate to limit the presence of minorities for the same reason? It is a 
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relevant question, for, while limits for Jews may be largely behind us, 
limits for Asians may be upon us. Furthermore, one cannot avoid the 
problem by arguing that it is appropriate to have floors for certain groups 
but inappropriate to have ceilings for others. Making more room for one 
group must reduce the room for others. Instinetively, one wishes for 
morally stronger justifications for affirmative action than institutional 
interests. Two are available. 

SOCIAL Utility* Consider the case of the crown prince of a large king- 
dom who also happens to be a young man of pedestrian intelligence and 
indifferent character. He applies to a competitive American university— 
Princeton, we shall say. Should Princeton admit him in preference to the 
many brighter and more virtuous students whose applications flood 
the admissions office? The social utility criterion may say yes, for this 
young man is eventually going to influence the lives of the millions of 
people in his own country. He may be drawn into issues that could affect 
international peace and prosperity. Princeton makes a contribution to 
human happ iness if it can help the crown prince develop into a thought- 
ful and humane adult. 

The same kind of calculation bedevils Professional schools in choos- 
ing among men and women. For example, if it is empirically true that 
women are more likely than men to leave a profession, there is an au- 
thentic question of resources to be considered when selecting who shall 
be trained in that profession. Given that the good called a medical ed¬ 
ucation is severely limited, how important is the ethical nudge in the 
direction of using scarce resources efficiently? Conversely, how impor¬ 
tant is it to get women into these professions so that, in the future, it 
will be easier for more of them to pursue such careers? 

Suppose now that it is again Princeton choosing between two can- 
didates, one black and one white. Both are from affluent Professional 
families, so socioeconomic disadvantage is not an issue. The white has 
higher test scores and (just to make the case still plainer) more glowing 
references than the black candidate. Both plan to become attorneys. In 
some sense, the white candidate “deserves” admission more. But who is 
going to provide more social “value-added”? Adding one more white at- 
torney to the ranks of prominent attorneys, or adding one more black 
one? Princeton could reasonably choose the black candidate on grounds 
that only by expanding the size of the next generation of minority 
lawyers, physicians, businessmen, and professors can society attain racial 
equality at the higher socioeconomic and Professional levels. Only 


when equality is reached at those higher levels will minority youths rou- 
tinely aspire to such careers. And, the argument continues, only when 
the aspirations for success and their fulfillment are thus equalized will 
we reach the kind of real racial equality that will eventually show up in 
test scores as well as everything eise. 

For now, let us ignore whether affirmative action will in fact have 
these good effects and concentrate instead on the logic of the argument. 
The same logic can justify not only choosing a member of a minority over 
a white, it can justify choosing a member of one minority over another. 
For example, a case may be made for systematically favoring blacks over 
Asians on the social utility criterion—based not on calculations that 
African slaves faced greater oppression in the past than the Chinese 
brought to build the railroads but on the proposition that the opportu- 
nities for a degree may be more valuably distributed to African Ameri- 
cans instead of Asian Americans, given the Contemporary state of affairs 
in American society. Indeed, early in this Century, when Colleges were 
discriminating against Jews, the reasons given, when they were given at 
all, were a mixture of institutional self-interest and social utility. [22] 

Once again, however, the rationale for affirmative action is not fully 
satisfactory. Looking back to the time when the numbers of Jews or 
women on a campus were strictly limited, most people feel uncomfort- 
able with the rationales, however dispas'sionately accurate they might 
have seemed at the time. They are uncomfortable partly because of the 
injustice, which brings us to the final criterion that should be part of 
the admissions process. 

JUST Deserts. Beyond institutional benefit and social utility, College 
admissions may recognize what might be called “just deserts.” As the di- 
rector of admissions to Columbia College expressed it, “One has to take 
into account how well one has done with the environment [an appli- 
cant has] been handed.” 23 The applicant who overcame poverty, cul- 
tural disadvantages, an unsettled home life, a prolonged illness, or a 
chronic disability to do as well as he did in high school will get a tip 
from most admissions committees, even if he is not doing as well acad- 
emically as the applicants usually accepted. This tip for the disadvan- 
taged does not seem unfair. 

This is the intuitive rationale of affirmative action for blacks, who 
were demonstrably the victims of legal oppression, enforced by the state, 
from the founding of the colonies through the middle of this Century, 
and of pervasive social discrimination that still persists to some degree. 
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To give blacks an edge because they are black accords with this sense of 
justice. At an elaborated level, there is a widespread impression that the 
underrepresentation of blacks and Latinos (and perhaps other groups, 
such as American Indians) in elite schools is an effect of racial or eth- 
nie injustice, properly corrected by affirmative action in university ad¬ 
missions. If it were not for the racism in our society, the groups would 
be proportionally represented, some believe. A still more elaborated 
Version of the argument is that the very approach to learning, reason- 
ing, and argumentation in universities is itself racist, so that the pre- 
dictors of university performance, such as SAT or IQ scores, are 
therefore racist too. Affirmative action redresses the built-in racism in 
the admissions process and the curriculum. 24 


Two Common But Invalid Arguments Regarding 
Affirmative Action 


We have reviewed the rationales for affirmative action without even men- 
tioning the two most commonly made points: first, that the real difference 
in academic ability between minority and white candidates is much smaller 
than the difference as measured by test scores, and, second, that gradations 
in ability do not count for much after a certain threshold of ability has been 
met. 

This first point is based on allegations of cultural bias in the tests, cov¬ 
ernd in Chapter 13 and Appendix 5. As readers will by now be aware, much 
research argues strongly against it* The second point, often expressed by 
university officials with the words “everyone we admit can do the work,” 
is true in the limited sense that students with comparatively low levels of 
ability can get passing grades. It is not correct in any broader sense. Higher 
scores predict better academic performance throughout the ränge of scores. 
There is no reason to think that a threshold exists above which differences 
in tested ability have little effect on the quality of the Student body, Stu¬ 
dent performance, and the nature of Student interactions. 25 


So there are three coherent rationales for concluding that it is just, 
as well as institutionally and socially useful, to admit minority students 
from specific minority groups even if they are somewhat less qualified 
than the other candidates who would be admitted. The rationales are 
not even controversial. Few of the opponents of affirmative action are 
prepared to argue that universities should ignore any of these criteria al- 
together in making admissions decisions. With that issue behind us, the 


question becomes whether affirmative action as it is being practiced is 
doing what its advocates want it to do. Does it serve worthwhile pur- 
poses for the institutions themselves, for students, for society at large, 
or for a commonly shared sense of justice? 

A Scheme for Comparing Rationales with Practice 

We will set the problem first with hypothetical applicants to College, di- 
vided into four categories, then we will insert the actual cognitive abil¬ 
ity scores of the College students in those categories. The four categories 
are represented in the 2X2 table below, where “low” and “high” refer 
to the full ränge of cultural and economic advantages and disadvantages. 


A Framework for Thinking about the Magnitude of Pref¬ 
erence That Should Be Given to a Minority Candidate 


WHITE 


Low 

High 


(3) 

(4) 


High 

Scarsdale 

Scarsdale 



Appalachia 

Scarsdale 


MINORITY 





(2) 

(i) 


Low 

South Bronx 

South Bronx 



Appalachia 

Scarsdale 



“Scarsdale” denotes any applicant from an upscale family. “South 
Bronx” denotes a disadvantaged minority youth, and “Appalachia” de¬ 
notes a disadvantaged white youth. Each cell in the table corresponds 
to a pair of applicants—a white and a minority—from either high or 
low socioeconomic and cultural circumstances. Starting at the lower 
right and going clockwise around the table, the categories are: (1) a mi¬ 
nority applicant from a disadvantaged background and a white from a 
privileged background; (2) a minority and a white applicant, both from 
disadvantaged backgrounds; (3) a minority applicant from a privileged 
background and a white from a disadvantaged background, and (4) a 
minority and a white applicant, both from privileged backgrounds. 

Imagine you are on the admissions committee and choosing between 
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two oindidates. Assume that all the nonacademic qualifications besides 
race are fully specified by high and low Status for this pair of candidatcs 
and that the IQ is the only measure of academic ability being consid- 
ered. (In other words, let us disregard grades, extracurricular activities, 
athletics, alumni parents, and other factors.) You are trying to decide 
whether to admit the minority applicant or the white applicant. How 
big a difference in IQ are you willing to accept in each cell and still pick 
the minority candidate over the white candidate? Let us consider each 
cell in turn, starting with the Situation in which the minority might be 
expected to get the largest premium to the one in which the premium 
arguably should go to the white. 

Cf.ll 1: Thf. South Bronx Minority versus the Soarsdale White. 
The largest weighr obviously belongs in the cell in which the minority 
Student is disadvantaged and the white Student is advantaged. Consid- 
erations of just deserts argue that it is not fair to equate the fest scores 
of the youngster who has gotten the finest education money and Status 
can buy with the test scores of the youngster who has struggled through 
poor schools and a terrible neighborhood. Considerations of social utih 
ity üigue that it is desirable to have more minority students getting good 
College educations, so that society may alter the effects of past discrim- 
inution and provide a hasis for an eventually color-blind society in the 
future. We assign ++ to this cell to indicate a large preference for the 
minority candidate. A relatively large deficit in the minority applicant's 
test score may properly he overlooked. 

Cell 4: The Soarsdale Minority versus the Scarsdalh White. If a 
College is choosing between two students in the higlvhigh cell, both 
from Scarsdale with college-educared parents and fatnily incomes in six 
figures, the social utility criteria say that there is a rationale for picking 
the minority youth even if his test scores are somewhat lower. But do- 
ing so would violate just deserts when the white Student has higher test 
scores and is in every other way equal to the minority Student. Which 
ciiterion should win out? There is no way to say for sure. Our own view 
is that, as personally hurtful as this injustice may be to the individual 
white person involved, it is relatively minor in the grand scheine of 
things. The privileged white youth, with strong credentials and parents 
who can pay for College, will get tnto a good College someplace. We 
rherefore assign a + to this cell to signify some ethnic premium to the 
minority candidate but less than in che first instance. 


Cell 2: The South Bronx Minority versus the Appalachian 
White. Now imagine a minority Student from the South Bronx and a 
white Student from an impoverished Appalachian community. The fam- 
dies of both students are at the wrong end of the scale of advantage. 
Which one should get the nod in a dose call? The white has just as 
much or nearly as much “social utility” going for him as the black does. 
American society will benefit from educating youngsters from disad¬ 
vantaged white Backgrounds, too. Both have a claim based on just 
deserts. America likes to think that people can work their way up from 
the bottom, and Appalachia is the bottom no less than the South Bronx. 
Perhaps there is some residual premium associated with being black, 
based on the supposition that just being black puts one at a greater dis- 
advantage than a white in the “all eise equal” case—a more persuasive 
point when applied to Blacks from the South Bronx than when applied 
to blacks from Scarsdale. We assign ~0 to this cell, indicating that the 
appropriate ethnic premium for the minority Student is not much greater 
than zero (other things being equal) and is certainly smaller than in the 
Scarsdale-Scarsdale case. 

Cell 3: The Scarsdale Minority versus the Appalachian White. 
Now we are comparing the privileged minority Student with the disad¬ 
vantaged white Student. Where one comes out on the scale of social 
utility depends on how one values the competing goals to be served. It 
seems hard to justify a social utility value that nets out in favor of the 
minority youth, however. (Yes, there is social utility in adding a minor¬ 
ity to the ranks of successful attorneys, even if he comes from an afflu- 
ent Background, but there is also social utility in vindicating the 
American dream for poor whites and in adding a representative of dis- 
advantaged white America to the ranks of successful attorneys.) Some- 
thing close to zero seems to be the appropriate expected value on the 
social utility measure, and the white youth should get a plus on the just 
deserts argument. If the choice is between a poor white youngster from 
an awful environment and an affluent minority youngster who has gone 
to fine schools, and if the poor white has somewhat lower test scores 
than the affluent minority, it is appropriate to give the poor white at 
least a modest premium. We thus enter - into this cell, to reflect the 
fact the white youth gets the nod in a close call. 

The filled-in table is shown below. We may argue about how large an 
ethnic premium, expressed in IQ, should be tolerated in each cell, but 







466 Living Together 


Affirmative Action in Higher Education 467 


A Rationale for Thinking About the Preference 
Given to a Minority Candidate 
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the ranking of the premiums seems hard to dispute. With this in mind, 
we are ready to examine how affirmative action in the NLSY sample 
squared with this view of the appropriate discrepancies. [26] 

Rationale vs. Practice 

To fill in the table with data, we divided NLSY students who went to 
four-year institutions into those in the upper and lower halves of 
socioeconomic background, using the socioeeonomic Status index 
described in Appendix 2. (We also conducted the analysis with more 
extreme definitions of privilege and disadvantage.) [27] We then seleeted 
the subsample of whites and blacks who had attended the same schools, 
and computed the mean IQ for the upper and lower halves of socioe¬ 
conomic Status for these matched pairs, statistically Controlling for In¬ 
stitution. Sample sizes of these matched pairs ranged from 72 for the cell 
in the top left to 504 for the cell in the lower right. The filled-in table 
below shows the difference hetween the white and black IQ scores in 
Standard deviations. [28] 

Let us try to put these numbers in terms of the choices facing an ad¬ 
missions officer. He has two folders on the desk, representing the lower 
left-hand cell of the table. The two applicants differ in cognitive ability 
by 1.17 Standard deviations, and both are socioeconomically dis- 
advantaged. More specifically (incorporating information about the 
means not shown in the table), one Student is almost exactly average in 
cognitive ability for such College students, at the 49th percentile of the 
distribution; the other is at the 12th percentile. Is it appropriate to treat 
the choice as a toss-up if the Student at the 12th percentile happens to 
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be black? [291 The typical admissions officer has, in effect, been treating 
two such applicants as a toss-up. 

We put the question in that way to try to encourage thinking about 
a subject that is not much thought about. How big an edge is appropri¬ 
ate? In a proper ly run System of affirmative action, should the average 
disadvantaged black and average disadvantaged white who got to a 
given College differ by so large a margin? 

Consider the next pair of folders, with two applicants from privileged 
backgrounds (the upper right-hand cell). One is at the 57th centile of 
College students, the other at the 23d centile, corresponding to almost 
a Standard deviation difference. Is it reasonable to choose each with 
equal likelihood if the one at the 23d centile is black, as the typical 
admissions officer now does? 

How might one justify the upper left cell, representing the privileged 
black versus the disadvantaged white, where the edge given to the black 
candidate should be no greater than zero under any plausible rationale 
for affirmative action (or so we argue), and probably should be less than 
zero? A disadvantaged white youth with cognitive ability at the 36th 
centile of College youths now has the same chance of being admitted as 
a privileged black youth at the 17th centile. 

Finally, consider the lower right cell, the one that mos-t closely fits 
the Image of affirmative action, in which a privileged white is compet- 
ing with a disadvantaged black. The logic of affirmative action implies 
a substantial difference in the qualifications of two youths fitting this 
description who have an equal chance of being admitted. Is the differ¬ 
ence actually observed—between a white at the 57th percentile of Col¬ 
lege students and one at the 12th percentile—a reasonable one? In IQ 
terms, this is a difference of almost nineteen points. 
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We do not suppose that admissions officers have these folders side by 
side as they make their decisions. In facr, given the pressures on admis- 
sionscommittees, thedeterminingfactor for admission is otten the sheer 
numbers of minority applicants. If the percentage of minorities in the 
incoming freshman dass goes up, that is considered good. If the per- 
centage goes down, that is considered bad. To make the numbers come 
out right, the admissions committee feels pressed to dig deeper into the 
pool ot available applicants if necessary. They do not want to ad mit mv 
qualified minority candidates, nor do they want to prefer advantaged 
minority applicants over disadvantaged whites. Bur these questions 
arise, if they arise at all, only after the more pressing matter of minority 
representarion is attended to. The goal is to have “enough” blacks and 
other minorities in the incoming dass. Meanwhile, white applicants are 
judged in cornpetition with other white candidates, using the many erb 
teria that have always been applied. 

The main purpose of the exercise we have just conducted is tu suggest 
that admissions committees should be permitted to behave a little more 
like nur imaginary one than they are at present, given the pressures from 
higher levels in the university. If university officials think that these 
data are not adequate for the purposes we have used rhetn, or if they 
think that we have misrepresented the affirmative aetion process, there 
is an easy remedy. Universities across the country have in their 
admissions files all the data needed for definitive analyses oi the rela- 
t ionship of ethnicity, socioeconomic disadvantage, and academic 
ahility—test data, grade data, parental Background data in profusion— 
for students who were accepted and students who were rejected, students 
w 7 ho enrolled and students who did not. At many sehools, the data are 
already in Computer files, ready for analysis. They may readily be made 
available to scholars without compromising confidentiality. Our 
proposition is that affirmative aetion as it is currently practiced in 
Americas universities has lost touch with any reasonable understand- 
ing of the logic and purposes of affirmative aetion. It is easy to put this 
proposition to the test. 

THE SIJCCESS OF AFFIRMATIVE ACTION IN THE 
UNIVERSITIES 

The success of affirmative aetion in the university is indisputable, in the 
sense that a consciously designed public policy, backed by the enthusb 


astic cooperation of universities, drastically increased the number of 
minority students who artend and graduate from College. The mag' 
nitude of the success during the first flush of affirmative aetion is 

apparent in the figure below, which shows the result for black enrolb 

no] 

ments. 


When aggressive affirmative aetion began, black College enrollment 

surged for a decade 



0 % -1-1-“i-r- 

1950 1960 1970 1980 1990 

Source: U.S. Bureau of rhe Census L975, 199g various editions. 


ln 1967, black enrollment of 20-24-year-olds suddenly shot up, and 
continued to rise steeply through the mid-1970s. White enrollment 
experienced no comparable surge during that period. The most plausi¬ 
ble cause of the surge is the aggressive affirmative aetion that hegan in 
the mid-1960s. On the other hand, this figure previews a problem we 
will discuss at more length in the next chapter: Whatever initial impe- 
tus was provided by affirmative aetion, it soon lost rnomentum. Black 
enrollment in the early 1990s was higher than the trendline from 1950 
to 1966 would have predicted, but some sort of evemng-out process 
seems to have set in as well. Black enrollment dropped during the late 
1970s, recovered modestly during the early and mid-1980s, then in- 
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creased sharply at the end of the decade. The level of black College 
enrollment as of the early 1990s is higher than at any other time in 
history. 

Furthermore, the enrollment of blacks rose not only to equality but 
to more than equality with whites of comparable socioeconomic back- 
ground and intelligence. As we showed in Chapter 14, the proportion 
of blacks obtaining College degrees substantially exceeds that of whites, 
after Controlling for IQ. As we have just finished documenting at length, 
the opportunity for College is also more open to blacks than to whites 
with equivalent test scores. 

Given the goals of affirmative action, it is appropriate to see this in- 
crease as a success. We assume as well (we have found no hard data) that 
affirmative action has also increased the sense among minority youths 
that College is an Option for them and increased the number of college- 
educated minority role models for minority youths. Still other benefits 
claimed for affirmative action—helping jump-start advances in the next 
generation of minority groups or improving race relations—are yet in 
the realm of speculation. 

THE COSTS OF AFFIRMATIVE ACTION IN THE UNIVERSITIES 

The costs of affirmative action have been measured in different ways. 31 
Relatively little of this commentary has involved the costs to whites. 
There are such costs—some number of white students are denied places 
at universities they could otherwise have won, because of affirmative 
action. 32 But most of the concem about affirmative action eomes down 
to this question: How much harm is done to minority self-esteem, to 
white perceptions of minorities, and ultimately to ethnic relations by a 
System that puts academically less able minority students side by side 
with students who are more able? There are no hard-and-fast answers, 
but at least we can discuss the magnitude of the problem from the stu- 
dent s eye view and from the vantage point of the general population. 

The Student’s Eye View of Minority and White Cognitive Ability 

Getting to know students from different backgrounds is a proper part of 
a College education. But given the differences in the cognitive abilities 
of the students in different groups, diversity has other consequences. To 
the extent that the groups have different scores, both perceptions and 
grades will track with them. Consider once again the probability of 


reaching College for students at different levels of cognitive ability. 
Comparatively small proportions of students with low intelligence get 
to College, no matter what their race. But the Student on the ground 
does not see the entire population of students with IQs in the bottom 
quartile (let us say). Rather, the only people in the bottom quartile 
whom he sees are the ones who reached College. 

To see just how different these perspectives can be, let us take first 
the extreme “above the battle” view of racial tensions that might be 
caused by affirmative action. The argument goes as follows: 

Yes, there is a racial discrepancy in test scores, though one should in - 
terpret those differences cautiously no matter what the evidence on cultural 
bias may be. But in reality we are talking about small numbers and small 
differences. In the NLSY data , blacks in the bottom quartile of cognitive 
ability who reach four-year Colleges amount to less than 4 percent of the 
youths on those campuses, while whites amount to almost 2 percent . Can 
anyone seriously think that this trivial difference can be a major problem? 

The answer seems as if it is self-evidently no. But now we switch to 
the view from ground level: from the vantage point of the College 
Student who attends classes, listens to fellow students talk in dass, 
observes what is going on in the library and the labs, and gossips with 
friends about other students. Let us imagine three observations of the 
kind that students commonly make in the normal course of Campus life: 
the racial mix of the entire Student population, the students who stand 
out because they seem to be espeeially out of place in a university, and 
the students who stand out because they seem to be espeeially smart. 

We will operationalize this student’s campus view by looking at the 
NLSY subjects who attended a four-year university (excluding histori- 
cally black schools), focusing on those with IQs that put them in the 
top and bottom 10 percent of such students. The figure below displays 
what our hypothetical Student sees. It shows students by IQ, but a fig¬ 
ure that contained the same breakdown by College grades (unavailable 
in the NLSY) would show roughly the same pattern. Backed up by the 
many studies that have examined the relationship between cognitive 
test scores (espeeially SAT scores) and performance in College: Cogni¬ 
tive test scores generally overpredict College grade point average (GPA) 
for both blacks and Latinos, in comparison to whites . 33 If anything, a 
figure showing students with the top and bottom 10 percent of GPAs 
would show an even greater ethnic discrepancy in College performance 
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The student’s eye view of cognitive ability 
Ethnie Composition of the Student Body on an Average Campus 
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between whites and blacks or Latinos than the discrepancy in IQs in- 
dicares. 1341 Similarly, the data from individual Colleges that t.pened the 
chapter suggest that this aggregate national picture would look no bet- 
ter, and might well look worse, in a school-by-school porrrait. 

Such large differences in performance are obvious to all, including 
other students. The problem, and a major cost of affirmative action, is 
that while blacks in the NLSY constituted only 12 percent ofthose who 
went to College, they were 52 perccnt of the students in the bottom 10 
percent in cognitive ability and an almost invisibly small Proportion of 
the top 10 percent. The Statistical difference that was trivial in the view 
from above the battle has become a large racial discrepancy at ground 
level. Meanwhile the imbalance between Latinos’ representation in the 
campus population and in the bottom 10 percent of intelligence is less 
obvious, while the “other” category (a combination of Asians, Pacific 
ethnic groups, and American Indians) is proportionately represented in 
the top and bottom (as a conglomerate—if we split them up, most of 
those in the top are Asian). We suggest that the figure presented above 
is important in trying to understand some of the most difficult racial 
Problems besetting Americas universities. 

Racial Animosity. Racial clashes on campuses began to surtace in the 
early 1980s and apparently have been growing since then, with the great 
bulk of the difficulties between whites and blacks. 55 A plausible expla¬ 


nation is that whites resent blacks, who are in facr. getting a large edge 
in the admissions process and often in scholarship assistance and many 
of whom, as whites look around their own campus and others, u dorOt 
belong there” academically. Some whites begin to act out these resent- 
inents. Blacks perceive the same disproportions and resentments, then 
conclude that rhe College environment is hostile to them. 

We will not pursue this line of argument. Rat her, we refer our read- 
ers ro a growing literature by black scholars who have couched it in the 
conrext of their own experience.lt is plain that affirmative action fos- 
ters differences in the distribution of academic ability across races in the 
communiries on College campuses. Students are not imagining these dif- 
ferences. 

Black Dropout Raths. The high black dropout rates from College are 
also easier ro understand in the light of the figure above. Typically, the 
black dropout rate from universities in the last decade has run at about 
twice the white rate. 1 w) This was also true of rhe NLSY. Of all those who 
ever entered a tour-year institution, 63 percent of whites had gotten a 
bachelor’s degree by 1990 (when the youngesr reached 26) compared to 
only 34 percent of blacks. But the discrepancy is not myst.erious. The 
first and dominant explanation of higher black dropout rates is cogni¬ 
tive ability. Controlling for age and IQ, the black and white dropout 
rates converge. Given the average IQ of those who entered tour-year in- 
sritutions (about 110), the expected probability that a youth entering a 
four-year College would graduate was 59 percent for blacks and 61 per¬ 
cent for whites, a trivial difference. 

But whereas cognitive ability explains most of the difference in 
dropout rates, it may nur explain everything. In particular, the NLSY 
data reflect the overall experience of blacks and whites, ignoring the 
experience at specific Colleges as we descrihed it earlier. Let us consider 
MIT, for vvhich dropout rates hy race have also heen reported. In 1985, 
the average SAT-Math score for a black male accepted at MIT was 
659, a score that put him above the 90th percentile of all students 
taking the SAT hut he low the 2 5 th centile of all students at MITT The 
dropout rare for hlack students at MIT in the mid-1980s was 24 percent, 
compared to 14 percent for whites. 40 Even if the average MIT hlack 
freshman in 1985 could indeed do the work there in some objective 
sense, getting discouraged abour one’s capacity to compete in an envi¬ 
ronment may be another cost of affirmative action, a phenomenon that 
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has beert described anecdotally by a number of observers, black and 
white alike. 41 

The Population 's Eye View of People with College Degrees 

The other vantage point to take into account is the view of the public 
toward minority and white College graduates. The College degree—what 
it is and where you got it—packs a lot of information in today’s Amen 
ica, not just as a credential that employers evaluate in hiring but as a 
broad social signal. One may lament this (people ought to be judged on 
their own merits, not by where they went to school), but it also has a 
positive side. Historically, that little sentence, “I have a [solid degree] 
from [a welhregarded university],” jolted you loose from any number of 
stereotypes that the person you encountered might have had of you, 
The reason it did so was that a welhregarded College had a certain set 
of Standards, and its graduates presumably met those Standards, No mat¬ 
ter what one’s view is of “credentialing” in theory, the greatest benefi- 
ciaries of credentialing are those who are subject to negative stereotypes. 
One of the great losses of preferential affirmative action has been to di- 
lute the effects of the university credential for some minorities. Today 
the same degree from the same university is perceived differently if you 
have a black face or a white one. This is not a misguided prejudice that 
will be changed if only people are given more accurate information 
about how affirmative action really works. On the contrary, more accu¬ 
rate information about how affirmative action really works confirms 
such perceptions. 

This unhappy reality is unnecessary. There is no reason that minor¬ 
ity graduates from any given College have to be any different from white 
College graduates in their ability or accomplishments, Restoring the 
value of the credential is easy: Use uniform procedures for selecting, 
grading, and granting degrees to undergraduates. Some difference in the 
cognitive distributions among College graduates would still remain, be- 
cause even if individual schools were to treat applicants and students 
without regard to race, we could expect some cognitive difference in the 
national distributions of graduates (since a group with disproportion- 
ately fewer high-scoring students would probably gravitate to less com- 
petitive schools; they would graduate, but nonetheless have lower mean 
ability). But within schools, the group differences could be as close to 
zero as the Institution chooses to get. America’s universities are instead 
perpetuating in the ranks of their graduates the same gap in cognitive 


ability that separates blacks and Latinos from whites in the general pop- 
ulation. As we saw in the data on law and medical schools, there is no 
reason to think that the gap shrinks as people move further up the ed- 
ucational ladder, and some reason to think it continues to grow. 

Some will argue the gap in ability is an acceptable price to pay for 
the other good things that are supposed to be accomplished by aggres¬ 
sive affirmative action. Our judgment, in contrast, is that in trying to 
build a society where ethnicity no longer matters in the important 
events in life, it is crucially important that society’s prestigious labels 
have the same or as close to the same meaning as possible for different 
ethnic groups. In the case of one of these key labels—the educational 
degree—policymakers, aided and abetted by the universities, have pre- 
vented this from happening. 

We will trace some of the consequences in the next chapter, when 
we turn to affirmative action in the workplace and present at more 
length our assessment of how the double Standard embedded in affir¬ 
mative action affects society, For now, we will observe only that the seeds 
of the consequences in the workplace and beyond are sown in Colleges 
and universities. To anticipate our larger conclusion, affirmative action 
as it is being practiced is a grave error. 

A POLICY AGENDA 

We urge that affirmative action in the universities be radically modi- 
fied, returning to the original conception. Universities should cast a 
wide net in seeking applicants, making special efforts to seek talent 
wherever it lives—in the black South Bronx, Latino Los Angeles, and 
white Appalachia alike. In the case of two candidates who are fairly 
closely matched otherwise, universities should give the nod to the ap- 
plicant from the disadvantaged background. This original sense of af¬ 
firmative action seems to us to have been not only reasonable and fair 
but wise. 

What does “closely matched” mean in terms of test scores? We have 
no firm rules, but as a guideline, admissions officers might aim for an ad- 
missions policy such that no identifiable group (such as a racial minor¬ 
ity) has a mean that is more than half a Standard deviation below the 
rest of the Student body. [42] This guideline is by no means demanding. 
In effect, it asks only that the average minority Student is at the 30th 
centile of the white distribution. Perhaps experience would prove that 
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this is not closely matched enough. But at least let us move toward that 
Standard and see how it works. The present Situation, with black stu- 
dents averaging well over a full Standard deviation below the white 
mean, sometimes approaching two Standard deviations, is so far out of 
line with any plausible rationale that universities today cannot publish 
the data on their admitted students and hope to persuade the public (or 
specialists in education) that their policies are reasonable. 

Would an end to aggressive affirmative action mean that minorities 
who can profit from a genuine College education will find the door of 
opportunity closed to them? There is no reason to think so. On the con- 
trary, we urge that people examine more closely an ignored, brief era in 
American university life—from the mid-1950s to the mid-1960s. Sn 
multaneously, the civil rights movement was gaining momentum, white 
upper-middle-class America was having its consciousness raised on the 
subject of racial discrimination, and color-blindness was actively taken 
as the ideal. At many Colleges during that era, applicants were forbid- 
den to enclose a photograph and instructed to avoid any information in 
the essay that might help identify their race or religion. Whether ad- 
missions committees were truly innocent of this information is another 
question, but the intent was clear, and so was the result: Racial differ- 
ences in qualifications during that time were minor, or so it appeared to 
both of us at the time. 

What were campus race relations like then? What were the attitudes 
of the black students toward achievement? What was the performance 
of black students relative to the predictions that might have been made 
based on their high school performance? What were the dropout rates 
of blacks relative to whites in the same institution? What were the sub- 
sequent careers of black students from that era? How do black students 
from that era, looking back, assess the pluses and minuses of the current 
state of affairs versus their experience? 

We must put such topics as questions because that era has been ig¬ 
nored. We suggest this possibility: American universities once ap- 
proached the ideal in their handling of race on the campus, and there 
is no reason why they could not do so again. 

Fewer blacks would be at Berkeley or Yale if there were no affirma¬ 
tive action. But admitting half as many black students to Yale does not 
mean that the rejected ones will not go to College; it just means that 
they will not go to Yale. For some individuals who are not chosen, this 
will be a loss, for others a blessing, but it is a far different choice from 


“college” versus “no College.” It is not even clear how much the goals of 
diversity would be adversely affected for the System as a whole. If affir¬ 
mative action in its present form were ended, the schools at the very 
top would have smaller numbers of blacks and some other minorities on 
their campuses, but many other schools in the next echelons would add 
those students, even as they lost some of their former students to schools 
further down the line. And at every level of school, the gap in cogni- 
tive ability between minorities and whites would shrink. 

Ending affirmative action as it is currently practiced will surely have 
other effects. Affirmative action does in fact bring a significant number 
of minority students onto campuses who would not otherwise be there. 
Perhaps the Overall percentage of some minorities who attend College 
would drop. But their white counterparts at the same level of ability and 
similar socioeconomic background are not in College now. To what ex¬ 
tern is a society fair when people of similar ability and background are 
treated as differently as they are now? In 1964, the answer would have 
been unambiguous: Such a society is manifestly unfair. The logic was 
right then, and right now. 
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Affirmative Action in the 
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Employers want to hire che best workers; employment tests are one of the best 
and cheapest selection tuols at their disposal. Since affirmative action began 
in the early 1960s, and especially since a landmark decisum by the Supreme 
Court in 1971, employers have been tightly constrained in the use they may 
make of tests. The most common solution is jor employers to use them but to 
hire enough protected minorities to protect themselves from prosecution and 
lawsuits under the job discrimination rules. 

The rules that constrain employers were developed by Congress and the 
Supreme Court hased on the assumptions that tests of general cognidve abil- 
ity are not a good way of picking employees, that the best tests are ones that 
measure specific job skills, that tests are biased against Blocks and other mo 
n<>ritie\s, and that all groups have equal distributions of cognidve ability. These 
assumptions are. empirically incorrect. Paradoxically , job hiring and promo- 
tion procedures that are truly fair and unbiased will produce the racial dis - 
parides that public policy trie :s to prevent. 

Have the job discrimination regulations worked? The scholarly consensus 
is that they had some impact, on some kinds of jobs, in some settings , during 
the 1 960s and into the 1970s, but have not had the decisive impact that is 
commonly asserted in political rhetoric. It also appears } however , that since 
the early 1960s Blacks have been overrepresented in white collar and Profes¬ 
sional occupations relative to the number of candidates in the IQ ränge from 
which these jobs are usually filled , suggesting that the efjects of affirmative ac¬ 
tion policy may be greater than usually thought. 

The successes of affirmative action have been much more cxtemivelji stud - 
ied than the costs. One of the. most urulerstudied areas ofthis topic is job per- 
formance. The scattered data suggest that aggressive affirmative action does 
produce large racial discrepancies in job performance in a given workplace . It 
is time that this important area be explored systematically . 
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In coming to grips with policy, afew hard truths have to be accepted. First , 
there are no good ways to implement current joh discrimination Imv without 
incurring costs in economic efficienc y andfairness to both employers and em- 
ployees. Second , after Controlling for IQ, it is hard to demonstrate that the 
United States still suffers from a majtrr Problem of racial discrimination in oc- 
cupations and pay. 

As we did for affirmative action in higher education, we present the case 
for retuming to the original conceptüm of affirmative action. This means 
scrapping the existing edifice ofjob discrimination law. We think the benefits 
to productivity and tofaimess ofending the anttdiscrimination laws are s ub- 
stantial. But our larger reason is that this nation does not have the Option of 
ethnic balkanization. 


A ffirmative action in the workplace arose at the same time thar it 
did in the universities but with important diffcrences. One differ- 
ence is that in the workplace, the government and the courts have heen 
the main activists, forcing businesses into a variety of involuntary prac- 
tices, whereas universities and Colleges largely create their own policies 
regarding Student selection. Affirmative action policies in the work- 
place have been more a matter of evolution than of coherent policy- 
making. (Appendix 7 traces this evolution.) Universities and Colleges 
occasionally run afoul of affirmative action laws in their hiring and pro- 
motion decisions, but in Student admissions they are usually far ahead 
of what has been legal ly required of them. 

A second important difference is that almost everyone has a personal 
stäke, and can see what is going on, in the workplace, unlike on Cam¬ 
pus. In Colleges, the applicant who does not get in hecause he was dis- 
placed hy an affirmative action admission never knows exactly why he 
was rejected. ln many workplaces, individuals can identify others who 
are hired, fired, and promored under the aegis of affirmative action, and 
they tend to have strong opinions about the merits of each case. In many 
workplaces, affirmative action decisions regarding a few people can af- 
fect the daily life of tens or hundreds of people who work with them and 
under them. College and university admission decisions have less obvi- 
ous immediate effects. These may he some of the reasons that few, if any, 
points of friction in American society have been ruhbed so raw as where 
affirmative action operates in the workplace. The topic inflames rela- 
rions hetween white elites (who generally favor the policies) and white 


workers (many of whom feel victimized by them), between ethnic 
groups, hetween the sexes, and between many citizens and their gov¬ 
ernment. 

The chapter is organized around several factual questions regarding 
affirmative action in the workplace. We Start with the facts hecause they 
are pivotal to the arguments about affirmative action yet are often over- 
looked or misconstrued. First, what are America^ affirmative action 
policies? Second, do they make sense, given the relevant data? Third, 
what difference have they made? After reviewing the data on these is- 
sues, we turn to some hroader questions that the facts raise but cannot 
altogether resolve. How should we think about the economic costs of 
affirmative action in the workplace? Assuming that just, about everyone 
wants employment to be fair, what should “faimess” mean in the labor 
market? 

Throughout, we concentrate on the Situation regarding blacks. Af¬ 
firmative action has expanded to embrace many other groups, but this 
policy came about hecause of an urgently feit national desire to redress 
the plight of blacks, and the focal point of tension, intellectual and so¬ 
cial, has been affirmative action for blacks ever since. Many of the points 
we make ahout that story apply with modifications to other groups as 
well. Our policy recommendations also apply generally. 

THE FEDERAL GOVERNMENTS REQUIREMENTS FOR 
AFFIRMATIVE ACTION IN THE WORKPLACE 

People apply for jobs. The employer hi res some and not others. Later rhe 
employer promotes some and not others. An employer who appears to 
have based hiring or promotion decisions on the person s being white (or 
one of the other outlawed reasons) is in violation of the law. A pure heart 
and good faith are not enough. If a rejected applicant or an unpromoted 
employee brings a complaint, an employer must he ahle to prove that the 
hiring and promotion processes meet legal definitions of faimess. 

For some posirions, employers may post job requirements anddemon- 
strate that the hired or promoted employees had rhe best qualifications. 
But many jobs do not lend themselves to such case-by-case selection. 
In these cases, how does the employer demonstrate that the chosen em¬ 
ployees have been selected without illegal discrimination?The ohvious 
answer (or so it seemed in the heginning) is to use an objective joh test 
and hire applicants with the highest scores. Testing has therefore heen 
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at the center of the history of employment discrimination law, as it has 
played out from the Civil Rights Act of 1964 to the Civil Rights Act of 
1991. Here are some features of the prevailing Situation facing cmploy- 
ers, with variations and an interlude descnbed in the appendix, since 
the Supreme Courts landmark Griggs v. Duke Power Co. decision in 
1971: 

If an employer uses a test in the employment process and the results 
of that test lead to different results for different protected groups (mainly 
hlacks, Latinos, and women) that employer faces the prospect of law- 
suits, fines, and damages that could cost the Company millions—per- 
haps tens of millions—of dollars. Employers can protect themselves in 
three ways. 

First, they may decline to use tests. Nevertheless, they will still he 
vulnerable if their alternative hiring process bas disparate impact (the 
legal phrase) on the hiring of different groups. 

Second, they can try to construct a test that has an urgent economic 
justification and a manifest, direct relationship with the skills required 
hy the job. A general ability test is always unacceptablc. Usually off-the- 
shelf tests of any kind will also be found unacceptable until they are vah 
idated for the particular job in question 

Third, an employer may meet the 80 percent rule. Creared as part 
of federal guidelines issued in 1978, the 80 percent rule says in effect 
that people in the protected groups have to be hired or promoted at 80 
percent or more of the rate enjoyed by the group with the highest rate 
of success in being hired or promoted. Here is how it works in pracrice: 
Stippose that the Acme Corporation uses a test for all its job applh 
cants. Let us say that 225 white males apply and 90 are hired. This hir- 
ing rate of 40 percent is the benchmark against which the hiring of 
other groups is measured. All other groups must be hired at a rate no 
lower than 80 percent of the 40 percent hiring rate of white males, 
which comes to 32 percent. If 150 white women apply and 50 are 
hired 33 percent—Acme meets rhe hiring rate for women. Suppose 
that 100 Latinos apply and 25 are hired. Now Acme is vulnerable to 
discrimination suits by the rejected Latino applicants because its 
hiring rate for Latinos is 25 percent, not 32 percent. Ir should hire at 
least seven more Latinos, bringing the Latino percentage up to the 
needed 32. [t| 

Note that we have said nothing about how the test was used or even 
what the comparative scores were. With the 80 percent rule, those con- 


siderations are irrelevant. It makes no difference if the rejected male ap- 
plicants had scores that were twice those of the successful women ap¬ 
plicants: All that matters is the hottom line: the 80 percent criterion. 
Less than 80 percent, and Acme is in trouble; more than 80 percent, 
and the government will probably leave Acme alone. Just “probably,” 
however. The 80 percent rule is a guideline, not a law, and there is no 
guarantee that meeting it will head off litigation. 1 “ 1 

SOME FALSE FACTUAL ASSUMPTIONS BEHIND EMPLOYMENT 
TESTING POLICY 

Federal affirmative action policy toward employment testing is laden 
with assumptions not about fairness but about what is true as a factual 
matter. Speeifically, Congress and the Supreme Court developed fed¬ 
eral job discrimination policy on the assumptions that (1) tests of gen¬ 
eral cognirive ability are not a good way of pick ing employees, (2) the 
best tests are ones that measure specific job skills, (3) tests are biased 
against hlacks and other minorities, and (4) all groups have equal dis- 
tributions of cognirive ability. 

To varying degrees, tbese assumptions were defensible vvhen they 
were first voiced in the 1960s. Ethnie differences in test scores were 
known to exist, hut many experts at: that time still thought they reflected 
test hias, or that the differences would melt away as educational oppor- 
tuniry for minorities improved. The predictive validity of tests for job 
performance was poorly understood. But however understandable these 
vievvs were in the 1960s, public policy over the next twenry years suf- 
fered from an increasingly severe case of psychometric lag. To summa- 
rize the by-now solidly established empirical Situation described in 
Chapters 3 and 1 3: 

• Cognitive ability has an economically important relationship to 
job productivity that applies across the ränge of jobs and the ränge 
of abilities. 

• Cognitive ability tests are often the single most predictive method 
of picking employees—more predictive than grades, education, or 
a job interview. 

• The predictive power of tests derives almost completely from their 
measure of general cognitive ability, not measures of job-specitic 
skills. 
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• Cognitive ability tests either are not biased against blacks as pre- 
dictors of job performance, or in some cases are biased in favor of 
blacks. 

• Different ethnic groups have substantially different distributions 
of cognitive ability that are not explainable by eultural bias and 
not easily altered by remedial steps. 

What is true regarding jobs, IQ, and group differenc.es in cognitive 
ability is the opposite of what the courts, the Congress, and many orh- 
ers have supposed the truth to be. The dilemma is that job hiring and 
promotion procedures that are truly fair and unbiased in the sense in 
which everyone used those terms in 1964 will produce the ethnic and 
group disparities that public policy so vigorously tries to prevent. The 
most valid hiring tests may have the largest disparafe impacr. As a first 
step in coming to terms with affirmative action—however one balances 
the many other factors that make affirmative action desirahle or unde- 
sirable—the govemmentshould scrap the invalid scientific assumprions 
that undergird policy and express policy in terms that are empirically 
defensible. 

This step need not mean scrapping affirmative action. It means 
only discarding rhetoric about testing and affirmative action (“tests 
aren’t valid for minorities,” “tests of general ability don’t predict any- 
thing worth knowing about job performance”) that are not true 
and instead defending affirmative action on whatever grounds can 
be authentically defended. Some progress has been made on this front. 
The Hartigan Committees report on the General Aptitude Test 
Battery was a step in the right direction, for example, acknow- 
ledging many of the key facts about tests while continuing to defend 
affirmative action (though the basis for their defense is in itself open 
to technical debate). A few other proponents of strong affirmative 
action are becoming more forthright about what they are real ly 
promoring—not just equal opportunity but equal employment out- 
comes despite unequal job performance. 4 But these are exceptions to a 
general public discussion of affirmative action that relies on inaccurate 
and to some degree dishonest representations of the state of knowledge 
about tests, employment, and competition among protected and un- 
protected groups. 


HAS AFFIRMATIVE ACTION WORKED? 

The scholarly debate over the effects of antidiscrimination legislation 
in the workplace has been lively, and this is a good time to summarize 
where that debate Stands. The answers are complicated, but scholars 
have done much better than the public commentators on this score. 

Version I: Ignoring Cognitive Ability 

According to official statistics, wages for blacks have risen since the 
1960s and more blacks have entered prestigious occupations. Most peo- 
ple rake for granted that these changes have happened to some impor¬ 
tant degree because of antidiscrimination laws. But what may seem 
obvious at first glance is not obvious upon further inspection. “Two 
decades of research have failed to produce Professional consensus on the 
comrihution of federal government civil rights activity to the economic 
progress of black Americans,” vvrote economists James Heck man and 
Brook Payner in 1989Q and the Situation has clarified only marginally 
since then. The nature of the problem facing the analysts is illustrated 
by the figure below for two categories of white-collar jobs that affinem- 

The uncertain effects of affirmative action in the workplace 

Pereentage of employed blacks 

_ Origgs 

1964 Civil decision Uniform 



S ourccs: Bureau ot Lahor Staristics 1983, 1989; U.S. Department ot Labor 1991. Figures prior 
ro 1973, reporred tor “blacks and orhers,” are adjusted pro-rata ro the black-only populär ion. 
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tive action was supposed to open up for blacks. N The vertical lines de- 
marcate three landmarks in antidiscrimination law: rhe passage ot the 
Civil Rights Act of 1964 that outlawed job discrimination, the Urigg.s 
decision that put increased pressure on employers to hire the right num- 
her of minorities even if they were using consistent hiring pracrices, and 
adoption of the Uniform Guidelines on Employee Selecrion Procedures 
that established the 80 percent guideline (all described further in Ap- 
pendix 7). 

To see why the analysts have a complicated task, consider clerical 
jobs (the gray line in the figure). The srory here seems ohvious: From 
1959 until the passage of the Civil Rights Act, improvement was slow. 
Immediately after the act came a sudden increase in the pereentage of 
employed blacks who held clerical jobs; thereafter the pereentage con- 
tinued rising but at a slower rate. Furthermore, the gap between black 
and white percentages for these jobs (not shown in this graph) also 
closed—again, faster for a while after 1964 than before. We might con- 
clude that the Civil Rights Act itself was effective but that the two suh- 
sequent landmarks in affirmative action policy were not, at least for 
these jobs. 

Now follow the black line in the above figure, representing Profes¬ 
sional and technical jobs. Its slope before 1964 was certainly no lower 
than its slope after; if anything, the slope decreased alter the act. Blacks 
were making progress before the act; afterward they weren’t progressing 
any faster in their movement: into these high-status, higlvpaymg occu- 
pations. Trendlines for other job categories, not shown in the graph, 
that were supposed to open up for blacks—managerial and administra¬ 
tive, sales, and craft workers—similarly fail to regisrer much ot a gain 
from the new policies. The clerical job category is the unusual case; it 
is the only joh category that shows a visible change in slope after 1964- 
lf evidence of success is to be found for affirmative action, it niusi be 
disentangled from a web of other factors that seem to have been intlu- 
encing the employrnent of blacks. 171 

This is not to say that antidiscrimination law had no effect, only that 
the effects on hiring and promotion are not simply demonstrated. Our 
understanding of the impact of affirmative action policies, drawn from 
a numher of technical assessments that have not taken cognitive ahil- 
ity into account, may be summarized as follows:* 


• Affirmative action policies had the expected effect in public bu- 
reaucracies. Police and firefighters are the most conspicuous ex- 
amples, but affirmative action also has clemonstrably increased the 
Proportion of minorities throughout government bureaucracies, 
from the federal level on down/ 1 At the federal levei, the strongest 
effects are at the clerical levei and below. In cities with large mi- 
nority populations, the effects are spreacl across a broader ränge of 
government. positions, with de facto quotas up to the highest lev- 
eis. 

• Among private Companies, affirmative action has had some 
effects, particularly in the South and among Companies that do 
business with the federal government. Some unknown fracrion 
of the increase in black employrnent by Companies with govern¬ 
ment contracts is balanced oft by compensating declines in Com¬ 
panies wirhout them. 

• ln private industry in the South (where much of the most demon- 
strable progress in private industry has been made), a complicated 
mix of forces seems to have been at work: partly the Civil Rights 
Act of 1964 and its aftermath, partly the repeal of Jim Crow laws 
restricting job entry into certain industries, partly a broader break- 
down of racial Segregation, legal and otherwise. 10 

• Whatever effects affirmative action may have had during rhe 
1960s and 1970s, they had become too small to measure by the 
1980s and will probahly conrinue to be small in the future, largely 
for economic rcasons. 

• The behavior of employers has certainly been affected hy job dis¬ 
crimination law. Every large Company must maintain a hureau- 
cracy to monitor compliance with federal regulations and to 
defend against (or, commonly, settle our of court) lawsuits alleg¬ 
ing discrimination. The amounts of time, money, and resourcesde- 
vored to compliance are substantial. 

In short, federal antidiscrimination efforts writ large—embracing all 
the disparate events following on the rise of the civil rights movement, 
in the mid-1950s—probably had a significant impact on black economic 
progress. Job discrimination law in particular probably had a smatler but 
significant effect for some blacks in some settings. No serious Student of 
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the topic argues that jobdiscrimination law had the decisive impact that 
is commonly attributed to it in political rhetoric. 

Version II: When Cognitive A bility Is Tal<en into Account 

We now pose a question of affirmative action that has not been asked 
in the literature we just reviewed: How do the observed differences be- 
tween blacks and whites in occupations and wages compare to those 
that would he predicted from the observed black-white difference in the 
distribution of cognitive ability? We presented the summary answer as 
of the end of the 1980s in Chapter 14, when we showed that, after Con¬ 
trolling for IQ, a higher proportion of blacks than whites in the NLSY 
are in the professions and that wages for blacks and whites are essen- 
tially equal. Neither education nor socioeconomic hackground, ac- 
counted as well as IQ for the differences in jobs or wages between blacks 
and whites. 

These findings may bear on the question of the impact of affirmative 
action in the workplace. To see why, let us examine the mean IQs for 
NLSY members in different job categories as of 1990, as shown in the 
table helow. In all job categories, from highest to lowest in skill, em- 
ployers are hiring blacks who differ from whites in those jobs by one or 
more Standard deviations in IQ. Part of the reason may be that env 
ployers hire blacks and whites of differing cognitive ability hecause of 


The Black-White IQ Difference by Job Category, 1990 


Job Category 

Mean White IQ 

Black-White Difference, 
in Standard Deviations 

Profess ions 

114 

1.3 

Managerial 

108 

1.1 

Technical 

113 

1.5 

Sales 

106 

1.4 

Clerical 

104 

1.1 

Protective Services 

103 

1.4 

Other Service jobs 

97 

1.4 

Craft 

99 

1.1 

Low-skill labor 

96 

1.1 


the pressures brought on them by government policies regarding the rep- 
resenration of minority groups. Without such pressures and in a race- 
blind labor market, blacks and whites should be equal in those traits that 
best predict performance on the job. From the kinds ot data reviewed 
irr Chapter 8, we know that cognitive ability is such a trait—the more 
so, the greater the skills are invol ved in the job. Consequently, we should 
expecr the IQ gap between whites and blacks to be the narrowest for 
high-skUl jobs i( hiring is race blind. 

We may draw this conclusion without knowing whether an employer 
administers cognitive tests to job candidates or even thinks consciously 
about cognitive ability when hiring. The relationship of cognitive abil¬ 
ity to job product iviry exists independent of the exisrence of test scores, 
and all hiring practices that succeed in choosing productive workers will 
tend to select employees with only small group differences in in teil i- 
gence for occupations in which IQ is most important. The table above 
shows no such narrowing for the cognitively demanding jobs. If any- 
thing the gap widens tovvard the top of the table. 

The most plausible explanation for the large gap tovvard the top of 
the table is that employers are using dual Standards for black and white 
job applieants. Moreover, we venture the hypothesis that employers are 
using dual Standards at least in parr hecause someone or something (the 
government or an aVersion to harmful puhlicity) is making them do so— 
hence our conclusion rhar affirmative action is probably having a more 
subsrantial impact on hiring practices than the Standard analyses indi- 
cate. 

This also leads to a reinterpretation of the graph on page 485 for 
clerical and Professional and technical jobs. We pointed out that the 
trendlines for black employees did not get steeper, with the single ex- 
cept ion of clerical jobs, after the Civil Rights Act was passed. Now we 
are suggesting an alternative perspective: The fact that the trendlines 
continued to go up as long as they did is in itself evidence of the impact 
of affirmative action. Without affirmative action, the trendlines would 
have leveled off sooner, perhaps at the point at which blacks and whites 
of equal IQ had equal chances of employment in high-status jobs. In the 
next figure, we adjust the hiring proportions for the known difference 
in IQ between whites and blacks. 111 ' For Professional and technical jobs, 
the assumption Ls that employees are normal ly drawn from people with 
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IQs of 98 or higher; for clerical jobs, the assumption Ls that they are 
drawn from withm the ränge of 86 to 123. 1121 The results are shown in 
the figure below. 

A revised view of equal employment opportunity after 
correcting for ethnic differences in the IQ distributions 

Black/white ratio (l=equality) a 

1964 Civil Griggs Uniform Professional* 



1991. 


■' 1 he rarin represenrs hlack.s employed in a ^iven occuparinnal ^rtnipiny ex- 
presset! as a pereema^e ot eli^ible blacks, divided hy the whites empluyed in 
the samt oceuparional groupin^ expressed as a percenta^e ot eli^ihlc whites. 
The number of eligibles is determined hy the size of the working-age popula- 
tion in rhar rate who fall within the IQ ränge for rhat oeeuparion, as ealeu- 
lated trom a rahle ot normal probabilities. The assumptions for eompurmg 
the ratio are: (J) the IQ ränge for Professional and technical jobs is 98 and 
higher; (2) the IQ ränge for clerical johs is 86-123; (3) [Q is normal ly dis- 
rrihured with a mean of 85 for blacks and 100 for nonblacks, with a Standard 
deviation of 15 for both groups. 


What“shouLT’ the lines look Uke? If the assumptions indrawing them 
were accurate, then both lines should have risen to 1 (to signity rhat 
blacks and whites in the same IQ ränge are hired at the same rate) ah 
ter the antidiscrimination laws were passed and then hovered near 1 
thereafter. Anything ahove 1.0 signifies a higher likelihood for blacks 


of being hired, once IQ is held constant; below 1.0, the opposite is true. 
The proportion of blacks in Professional and technical jobs rose above 
1 in the early 1960s, flattened after the Civil Rights Act of 1964, took 
another steep jump after Griggs, and then settled into a gradual rise 
through the late 1980s. For clerical johs, progress after 1964 led to pan 
ity in the late 1960s. The relative proportion of blacks in clerical jobs 
then continued to incrcase at a slower but more nearly linear pace since 
then. ln both categories of employment, blacks have been hired at 
higher rates than whites of equal IQ since the late 1960s, and the up- 
ward trend lasted at least until the late 1980s. 

Since rhese job categories do not have precisely defined IQ ranges, 
it may be asked what would happen if the assumptions were changed. 
Some of the alternatives we tried are described in the note to this Para¬ 
graph. The short answer is that the picture stays essentially the same 
within any reasonable ränge of assumptions. The overall conclusion is 
that blacks have for some years had more people working in both cler¬ 
ical jobs and Professional and technical jobs than would ordinarily be 
expected, given the IQ ränge from which those jobs are usually filled. |n| 

The figure above uses broad guidelines ahout the IQ ränge from which 
certain johs are held and applies vhem to national data abour occupa- 
tions. For a narrower focus, the NLSY supplies data about specific indi- 
viduals, their occupations, and IQs. [M1 In 1990, using the same detinirion 
of “professional and technical occupations,” and after Controlling for IQ 
(set at 11 3, the mean IQ for whites in such occupations), the Propor¬ 
tion of blacks in the NLSY employed in Professional and technical oc¬ 
cupations was 1.5 times the proportion for whites, compared to the ratio 
of 1.7 shown for 1990 in the graph. For clerical johs, after Controlling 
for age and IQ (wirb IQ set at 103, the mean value for whites holding 
clerical jobs), a black in the NLSY was 1.9 times more likely than a 
white to he employed in a clerical job, compared to the figure of 1.6 for 
1990 as shown in the graph. lis ' The conclusion drawn from national sta- 
tistics Ls thus confirmed by the individual data in the NLSY. 

Several points may be drawn from this exercise. First, it highlights 
the reality and magnitude of the discrimination suffered by blacks prior 
to the civil rights movement. As recently as 1959, the employment of 
blacks in clerical and Professional and technical johs was only halt the 
proportion that would have been expected from recruitment to those 
jobs based on IQ alone. Decennial census data (not to mention living 
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memory) rell us that this undcrrepresentation was still more severe in 
the 1950s and 1940s. 16 There was a clear and large racial deficit to he 
made up. 

Second, the exercise shows how rapid ly changes were made in the 
1960s and early 1970s. If cognitive ahility is raken into account, the um 
derrepresentation of hlacks in professional and technical johs was gone 
hy 1964, prior to the Civil Rights Act. This closing ot the occupational 
gap between hlacks and whites, obscured hy trendlines that do not com- 
pensate for IQ differences, argues that something besides antidiscrimi- 
nation legislation was already atoot in America, making the joh market 
less stacked against hlacks. 

Third, hy the end of the 1960s, the joh market had pressed heyond 
the point of parity for hlacks and whites, again alter cognitive ahility is 
taken into account. One might argue that this merely proves that IQ is 
not so important for joh productivity alter all—except that a large lit- 
erature, already summarized, demonstrates heyond rnuch douht that IQ 
is as predictive of joh performance for hlacks as for whites. 11 We can only 
surmise that the reason for attaining such high levels of black repre- 
sentation, particularly in the occupations that most strongly correlate 
with IQ, incIndes the impact of affirmative aotion policies. To that ex¬ 
tern, if these affirmative action policies were ehanged, black employ- 
ment in these occupations would fall. Would this he a return to 
unfairness? We will retum to this hard question alter considering the 
costs of affirmative action for joh performance. 

THE COSTS OF AFFIRMATIVE ACTION: JOB PERFORMANCE 

Inasmuch as cognitive ahility is related to joh performance and as mi- 
nority workers are entering professions with lower ahility distrihutions 
than whites, is there evidence of lower average performance for minor- 
ity workers than for whites? Of all the many kinds of double-speak as- 
sociated with affirmative action, this question points to (me of the most 
egregious. Private complaints about the incompetent affirmative-action 
hirce are much more common than scholarly examination of the issue. 
We may nonetheless present several cases bearing on joh performance, 
all teiling similar stories for different occupations, using different kinds 
of data. 


Teacher Competency Examinations 

The nationwide enthusiasm for teacher competency examinations in 
the 1980s resulted in teacher testing programs in virtual ly all States hy 
the end of the decadeT These competency tests are seldom job perfor¬ 
mance tests as such, but rather a test of hasic knowledge of reading, wrir- 
ing, and mathematics. Even so, teachers who score higher on the tests 
have greacer success with their studentsC The competency exams seem 
to have had some general ly heneficial effects, though the cutoffs are low 
hy the usual Standards of what we expect teachers to knowd 0 The pass 
rates for whites typically exceed 80 percent and sometimes 90 percent. 
Whatever your profession may he, think about the meaning of a test 
that would “pass” aspirants to the profession w ; ho perform in the bot- 
tom 20 percent. Bur having so low' a cutoff for whites sharpens the ev- 
idence of the di spar ity in black and white qua lificat ions, as shown in 
the following tahle. 


Typical Results of State 
Teacher Competency Examinations 



Pass Rate 

Implied 


Whites 

Blacks 

Difference in SDs a 

California, 1983-1991 

80% 

35% 

1.2 

Pennsylvania, 1989 

03 

68 

1.0 

New York, 1987 

83 

56 

1.3 

Georgia, 1978-1986 

87 

40 

1.4 


Sourcv.s: H. C "ollins, “Minority groups are still lagging on reacher exam,” 
Phikidelpfua Jnquirer, Aug. *5, 1^89, p, Bl; T. Spottord, “Teacher fest called 
hiased,” A/Kmv 7imes Cniun, Nov. 20, 087, p. Al; R. Pavila, “States 
teacher lest biased against minorities, lawsuir contends,” Saemment/) Bee, 
Sepr. 24. N92, p. B8; "Minoriry teachers,” Kiehmond News Leader, May 16, 
IW, p. AH. 

' Assumes a normal disrrihurion and equal srandard deviations in horh 
«roups. 
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These are not cognitive ability scores or scores that are being usecl to 
select people for further education hut the scores achieved hy people 
who are heading into the nation’s classrooms. According to the insti- 
tutions thathavegraduated theseapplicantsfor teacher certification (in 
some cases, the scores are for teachers already on the job), all of them 
have met the requirements for a College degree, and they presumably 
can read, write, and do hasic math. The scores are on tests that make 
no pretense to seek excellence but to weed out the most obviously un- 
suitedJ 211 With differences ranging upwards of 1 Standard deviarion, the 
inescapable conclusion is that a large gap separates black and white 
teachers in basic skills. 1 ^ 1 


The Compensating Skills Fallacy 


One of the mosr common arguments ahour the current practice of affir¬ 
mative action might be called the compensating skills fallacy. It is com- 
monly applied to any profession ander discussion, but teachers provide an 
especially good example. The argument goes like this: 

T/utc are many skills and cfualities that go into being a good teacher hesides 
test scores. The ability to inspire amfidence, to create an eagemess to learn, to 
listen to children are all pari of the unde repertoire of skills that ßf mw being a 
good teacher that have nothing to do with the traits measured by a coynmivc abib 
ity or acadcmic skills te.st. 

The Statement itself is correet. Most professions involve a numher of 
important noninteliectual at tri Hutes. The fallacy lies in assuming tliat peo¬ 
ple who have lower cognitive lest scores will, on average, be hetrer en- 
dowed in these other areas than people with higher scores. 

Suppose rhat the teacher comperency exams consisted of several parts, 
each of vvhich measured one of these noninteliectual skills, h would he 
possihle to detend hiring teachers wirh marginal grades on the intellecfual 
skills if these teachers were hi red from the top of the list on the tests of the 
other qualities. But the way affirmative action progranis actually work, 
rhese other qualities are not tesred or compared. The minority candidate 
with the best score on the tesr of intellectual qualities is selected. As for 
the other qualities, not measured by the test, there is no reason to assume 
that they are any higher than average. 1 " 


A Joumahst’s Account of the Washington , D.C., Police Force 

Because affirmative action has been practiced most aggressively in pub¬ 
lic employment—police, firefighters, social welfare agencies, depart- 
ments of motor vehicles, and the like—they are logical places to look 
if indeed job performance has been compromised. 24 The Washington, 
D.C., Police Department is a case in point, as described by journalist 
Tucker CarlsonT 

In the mich 1970s, the Washington, D.C., Police Department 
insrallecl a residency requirement for police. Washingtons white popu- 
lation is densely concentrated among whitc-collar and Professional 
groups, with no significant white working-class neighborhoods. The res- 
idency requirement thereby severely rest.ricted the pool of potential 
white applicants. By 1982, 40 percent of the candidates who took the 
police admissions test failed it, and the department was having a harcl 
time filling positions. A new test was introduced in 1985, normed to fa- 
vor minority applicants. Standards in the police academy were lowered 
to the point at which not one Student flunked out of the training course 
in 1983 (despite the lower cognitive ability of the candidates being ad- 
niitred). In 1988, the academy abolished its final comprehensive pen- 
cihand-paper examinarion after 40 percent of graduating recruits failed 
it. The formet head of the Fraternal Order of Police and a Veteran of 
twenty-two years on the force reported that, at ahout that time, he he- 
gan Hearing “ahout people ar the academy who coulcl not read or 
writc\” 2( ' A former academy instructor says that “I saw people who were 
practically illiterate. l’ve seen people diagnosed as borderline retarded 
graduate from the police academy.”" 7 

This degradation of intellectual requirements translates into police 
performance on the Street. For example, the paperwork that follows an 
arrest has been a bane of police everywhete for many years, but when 
police can do the work, it is mainly an inconvenience, not a barrier. An 
officer who cannot do the paperwork or who finds that it pushes the lim- 
its of his abilities may forgo making arrests in marginal cases. The ar- 
rests that are made are often botched. Between 1986 and 1990, about a 
third of all the murder cases brought to the U.S. attorneys office in the 
District were dismissed, historically an unusually high rate, often he- 
cause the prosecutors were unable to make sense of the arrest reports. 
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The Basic features of Carlson’s account are confirmed by a variety of 
other joumalistic accounts, most conspicuously a 1993 investigative se- 
ries by the Washington Post on police performance. 2h Two facts about the 
Washington Police Department seem clear: Recruitment and training 
Standards deteriorated markedly in recent decades, and the performance 
of the department, once considered a national model, has also deterio¬ 
rated badly. 

Washington is not unique. In Miami in 1985, the police department 
was rocked by the discovery and seizure of hundreds of pounds of co¬ 
caine hidden by police officers working in cahoots with smugglers. We 
have the results of the intense self-examination that resulted. The main 
conclusion was that this crime, as well as the many others that were 
straining comniunity-police relations at the time, could be traced in part 
tt^> the relaxation of hiring Standards mandated by affirmative action reg- 
ulations. Almost 90 percent of the officers who were clismissed or sus- 
pended within a few years of the Initiation of aggressive affirmative 
action policiesat the beginning of the 1980s were officers with marginal 
qualifications, hired because of those policies. 20 

Such stories are common among people who have worked in, or been 
a dient of, organizations that practice aggressive affirmative action, and 
the link they ascribe to affirmative action is usually explicit and em- 
phatic. |i0] There is a great deal of smoke emanating from such accounts. 
We urge that people Start checking out whether there is any fire. 

A Scholarly Analysis ofan Affirmative Action Program 
for Blue'Collar Jobs 

Econoniist Eugene Silberberg systematically compared the experience 
of blacks who were admitted to craft unions (electricians, plumbers, and 
pipefitters) in Seattle at the end of the 1970s under a court order and 
whites who were admitted under ordinary selection procedures at the 
same time.' 1 Silberberg assembled data on performance in apprentice 
school, on-rhe-job ratings, and educational Background, then was given 
access to a variety of job performance measures over an eighteen-month 
follow-up period: hours worked, numher of employees who quit, jobs 
tumed down, failures to respond to a clispatch, and being listed by an 
employer as not eligible for rehire. The table below shows the combined 
differences, expressed in Standard deviations, for the pipefitters and 
plumbers. 


Job Performance of Black Affirmative Action Plumbers 
and Pipefitters Compared to White Regular Hirees 

Black-White Difference in SDs 


Job perjorrmmce measures 


Quits or no rehire 

+.6 

Termination for cause 

+.5 

Nonresponse to job call 

+.6 

Hours worked 

-.9 

IQ-related measures 


GPA in apprentice school 

-1.3 

GPA in on-the-job training 

-.8 


Source: SillxTherg 1985, Table 2. 

Note: The table coirtbines data on appreutices and journeyman for both 
erafts usinLj wei^hred srandard deviations. 


Comparing the blacks admitted under the court order with whites 
admitted under the ordinary procedures at the same time, the blacks 
quit at more than six times the rate for whites, were terminated for cause 
at more than three times the rate for whites, and did not respond to a 
job dispatch at more than six times the rate for whites. Similar results 
were obtained for the electricians. The results track closely with the 
larger literature on IQ and job productivity. The differences in the job 
performance measures are what might be expected from the discussion 
in Chapter 3. Furthermore, the size of the difference in job performance 
is economically important. Silberberg discusses the possibility that the 
differences are themselves a result of bias among the dispatchers and Su¬ 
pervisors. Given the procedures for assigning jobs in the Seattle unions, 
he concludes that it is extremely difficult to explain away the differences 
in such terms. 1 ' 21 

Having reviewed the less than plentiful data at hand about ethnic dif¬ 
ferences in job performance, we are reminded of a passage by Andrew 
Hacker, one of the stoutly “pro” voices in the affirmative action debate: 

A favorite question of affirmative action’s opponents is whether 
you would want to be operated on by a surgeon who had been ad¬ 
mitted to medical school under a racial dispensation. As it happens, 
few posing this kind of question have any knowledge of what makes 
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for surgical skill. In fact, there are no known correlarions between 
good grades or high scores and subsequent success with a scalpel. If 
we mean to debate this subject seriously, we should rely on hard data 
rather than scare tactics.* 3 

We cannot agree with Hackers characterization ot the state ofknowl- 
edge, but we enthusiastically subscribe to bis concludingsentence. By all 
means, let people on all sides of this issue assemble hard data. The pur- 
pose of the foregoing examples is to make two poinrs: (1) the scattered 
evidence ahout job performance and affirmative action—indirect and 
direct, soft and hard—suggests large and pervasive effects, and (2) there 
is no excuse for not having many more hard-data studies of the type that 
Silberberg conducted. Job performance is important, ir is mensurable, 
and the issue of affirmative action and its effects on job performance has 
been on many people’s minds for years. Many corporations routineiy 
conduct studies of job performance and have databases that could be re- 
analyzed to assess the effects of affirmative action on job performance, 
The request we make of Hacker and other proponents of aff irmative 
action is that they join us in encouraging such work. Confident that 
groupdifferences in job performance are not an important problem, they 
can try to prove their case. Our own conclusion is that they cannot dt) 
so. If this is so, the debate about affirmative action must shift to another 
level: How much degradation of job performance is acceprable in pur- 
snit of the other goals of affirmative action? And that in turn brings 
us to first questions. What, after all, is the nation trying to accomplish 
with affirmative action in the workplace? What are the righr measures 
of success? 

A POLICY AGENDA 

In thinking about affirmative action in the workplace, more than psy- 
chometric realities or efficiency in the workplace must be considered. 
To avoid misunderstanding, this is a good time to lay out our perspec¬ 
tive on these other matters. 

• As of the 1950s, minorities, especially blacks, in many parts of the 
country were systematically and unjustly excluded from entering 
skiüed and Professional occupations of all kinds. 

• At least since the 1950s, changes in white attitudes, as expressed 
in the civil rights movement and in myriad other events in race 


relations, the removal of Jim Crow restrictions in the South, and 
affirmative action requirements opened up opportunities for mi¬ 
norities. Progress was made. 

• ln the 1990s, racial hostility continues to be a significant problem 
in American life. 

• Affirmative action has an internally consistent rationale even if it 
is at odds with the maximum efficiency in hiring productive work- 
ers. 

This last remark calls for some elahoration. Suppose, for the sake of 
argument., that we are sure that a history of unfair discrimination has 
handicapped some people so that they fare less well in the job market 
than they otherwise would. Their handicaps may handicap their de- 
scendants, so that past Unfairness is propagated mdefinitely into the 
future, unless we do something about it. A properly const.ructed affir¬ 
mative action policy may then be temporarily less efficient but more ef- 
ficient in the long run. If iL achieves long-run efficiency by breaking the 
cycle of past discrimination, it is arguahly fair. And even if the long run 
is indefinitely far off, many people are willing to pay some price in lost 
productivity for a large enough gain in group equality. 

Or suppose that we knew that the inequality in employment that we 
observe arises for reasons we consider inherently unfair. Perhaps blacks 
are, for example, not being hired to be shop clerks in neighhorhoods be- 
cause the customers (or the other workers) are bigoted. 1 ^ 1 It may be ef¬ 
ficient to hire fewer clerks who will be discriminated against, but it. is 
not fair. Many people would be willing, again, to lose some efficiency in 
return for grearer equality. 

In short, we sympathize with some of the imaginable reasons for af¬ 
firmative action in the workplace and are under no illusions about the 
ways in which perceptions of racial differences still affect employers’ hir¬ 
ing decisions. But affirmative action does not mean just wanting good 
rhings. It means specific and often substantial constraints on the em- 
ployer’s ability to make use of the most qualified people. What should 
we make of such policies as of the 1990s? 

Trying to Reconäle Ethnie Equity and Competitive Fairness 

It is possible for an advocate of current affirmative action policies to 
concede all the factual points we have made in this discussion and still 
be in favor of continuing and even stronger affirmative action policies. 
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For such advocates, it makes no difference if rhe tests are reliable and 
valid predictors of job performance. lf a disadvantaged group pertorms 
at a lower level, ro these advocares, it is self-evidently society’s fault, 
and government must take whatever Steps are necessary to bring the Jis- 
advantaged group up to the level of other groups, ensuring equal cur 
ployment and income in the meantime. Sometimes this argument is 
couched specifically in terms of the black experience in the United 
States, sometimes as part of a hroader argument for an egalitarian 
agenda. iS 

Our dispute with the egalitarian position has to be carried out onetlv 
ical and philosophical grounds, for there is nothing much to argue about 
in the facts. Briefly, we differ with the Contemporary advocates of coiv 
tinued quotalike hiring requirements on two counrs. 

First, we adhere to the 1964 view of what constitutes fairness, exene 
plified hy Hubert Humphrey, who, in fighting for passage of the Civil 
Rights Act of 1964, declared thar it “does not 1 imit the employer s free- 
dom to hire, fire, promote, or demote for any reason-—or for no rea- 
sons—so long as his action is not based on race,” and then volunteered 
to eat the Hill in. public if he were wrong about ivhar the new law would 
dod 6 Like the Senator, we reject equality of outcome as an appropriate 
goal. Equality of opportunity is the test most consistent with the Vision 
of rhe Congress that enacred the law in 1964, and for that matter with 
rhe vision that animated the Constitution. The appropriate goal is a job 
market in which people are not favored or held back simply because of 
their race. Nothing in nature or knowledge, however, says that al I groups 
should be equally successful in every walk of life. This may be “unfair” 
in the same sense that life is unfair, but it need not mean that human 
heings are treating one another unfair[y. 

Consider the convenient and appropriate case of athletic perfor- 
mance. By the Standard of proportional equality, there are u too many” 
black players in the National Basketball Association compared to the 
number of white players. No one thinks this is unjust. When profes- 
sional tennis equalized the purses for male and women champions, it did 
not also require the men and women to play against other, because 
everyone recognized that all the top men would almost always beat all 
of the top women. lf men and women players were ranked in a single 
list , would there be “too many” males among the top 100 tennis players 
in the world ? Any particular disproportion inay be unfair, but it may not. 
It may be less obvious why there are disproportions in other pursuits, 


hence harder to teil whether they are fair, but the principle is the same, 
and simple: If the quäl ity of performance fair ly differs among individu- 
als, ir may fairly differ among groups. [3?l lf a disproportion is fair, then 
“correcting” it—making it proportional—-may produce unfairness along 
with equal representation. We believe that is what has happened in the 
case of current forms of affirmative action. People who bring equal quab 
ificarions to a job should have an equal shot at heing hired, and affir¬ 
mative action regulations, originally intended to promote precisely that 
goal, now impede it. 

Second, the debate will he healthier if those who want private busi- 
nesses to support social objectives openly acknowledge thar such Sup¬ 
port does in fact entail costs in efficiency and productivity, hence the 
henefits that flow from greater efficiency and higher productivity—in- 
cluding a stronger economy for American society as a whole. 1381 Nor are 
the costs in productivity unique to private businesses. When a police 
department hires people who become less effective police officers than 
those it could have hired, the department loses some of its capahility to 
provide law enforcetnent. Affirmative action can cost something in gov- 
ernment Services every bit as much as in the productivity of a private 
husiness. 

We do not require equal outcomes, but we du want fair treatment. 
What policy alternatives mighr be employed to bring ahout this state 
(Taffairs in hiring and promotion? Before exploring four alternatives, 
[et us say clearly that the worst alternative, the one we do not discuss 
further, is what we are now doing: not raising the question at all and 
proceeding as if there are easy and costless ways to achieving fairness. 

Alternative 1: C reating Tests That Are Legal Under the Current 
Requirements 

In rheory, employers could construct job-specific tests that meet the 
Supreme Courts (and now the Congress’s) definition of fairness. It 
would be expensive, and the tests would seldom (if ever) hc more pre- 
dictive than a general test of cognitive ability. But it is feasible. The dif- 
ficulty is that predictiveness comes primarily from the tests’ measure ot 
g. Therefore, although they cannot be faulted under the other legal re- 
quirements, they will nonetheless he thrown out because of disparate 
impact. This is what has happened most famously at New York City’s 
Police Department, which for more than a decade has been spending 
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large amounts of money trying to create a sergeant’s examination. Each 
successive Version has met strict Standards of job specificity and free- 
dom from demonstrable cultural bias, but large ethnic dispariries have 
persisted. The disparities themselves invaiidate the test, and a new Ver¬ 
sion must be prepared. The police department has even used a video- 
based test, on grounds that any form of paper-and-pencil fest must 
necessarily discriminate against minorities. 

The case of the New York Police Department is one example of 
many. 40 In practice, no test that produces disparate results has been able 
to withstand challenge. The lesson of the last two decades is that eth¬ 
nic bias in a job test need not be proved. It need only be alleged. This 
has been most consistently the case for public employment—police, 
firefighters, sanitation workers, teachers, administrative staff—where 
political constituencies can most easily bring pressure to hear. 

Alternative II: Choosing Among Applicants with Equal Education 

ürdinarily a fair way to ease the existing affirmative acrion requiremcnt 
would be to permir employers to narrow the pool of qualified applicants 
by using education as a screen. Thus, for example, the 80 percent rule 
(see the definirion on page 482) could be calculated on the basis of ap- 
plicants who met a minimum educational level, not all applicants. But 
affirmative action at the university level (Chapter 19) prevents this So¬ 
lution from working, because the same degree may not have the saine 
meaning for blacks, Latinos, and whites in terms of cognitive ability. Wo 
showed this for the bachelors degree in the preceding chapter. Rur ein- 
ployers who try to make finer discriminations are no Fetter off. ln the 
NLSY, the black-white differences for every educational level, from 
high school diploma to Ph.D, are large, wirh the smallest being a dif- 
ference of 1.2 Standard deviarions.' 411 

Nor does it help to differentiate by major area of study. In the NLSY, 
a black and a white with a bachelors degree in engineering, math, or a 
hard Science—majors that would apparently be least susceptible to dou¬ 
ble Standards—were nonetheless separated by 1.1 Standard deviarions 
in IQ. Differences for other common majors (behavioral and social Sci¬ 
ences, fine arts, education, or business) ranged from 1.4 to 1.6 Standard 
deviations. For Latinos, the gap was smallest for engineering, math, or 
a hard science (.7 Standard deviarion) and ranged from .9 to 1.3 Stan¬ 
dard deviations for the others. 


The educational credential used to be an effective way for a person 
from a deprived background to stand on an equal footing with other job 
applicants. It is still so treated that way in political rhetoric. The real- 
ity facing employers is that, given the aggressive affirmative action that 
universities have employed over the last three decades, educational cre- 
dentials can no longer he used to compare the intellectual qualifications 
of black, Latino, and white job candidates. 

Alternative III: Rcu:e Nomiing 

An employer who hires large numbers of people cannot very well get 
along without using a test, but at the same time probably cannot devise 
a test that will pass muster with the government. So it will have to test 
applicants knowing that the test will produce unacceptably large group 
differences betvveen whites and blacks, then comply with the 80 per¬ 
cent rule by hiring additional applicants from the protected minorities. 

The simplest way to do this is to employ a pass-fail cutoff. Everyone 
above the cutoff is deemed qualified for the job, and then the employer 
uses other methods to choose among the candidates, making sure that 
the end result meets the 80 percent rule. This is a common solution 
and requires only that the cutoff be low enough that a sufficient num- 
ber of protected candidates get into the final group of candidates, 1421 
But the pass-fail cutoff throws away a great deal of valuable Informa¬ 
tion. Suppose that after complying with the 80 percent rule, the em¬ 
ployer ends up with six new white employees out of twenty whites who 
applied and two out of seven black applicants. Why just take any six 
whites who scored above the cutoff? Why not instead take the whites 
with the top six scores? Similarly, why not take the top-scoring two 
blacks? 

This is called top-down hiring. If the test has high validity, if the 
group differences are large, and if there are many applicants, it is much 
more efficient than a cutoff. 45 But there is a difficulty with this method. 
By deciding in advance on the number of whites and blacks who will be 
hired and then picking the top-scoring candidates, the employer is us¬ 
ing quotas, which is illegal (even before the 1991 Civil Rights Act, an 
employer who used explicit quotas was vulnerable to legal action). 

One way to get around this difficulty is to use race norming. The 
raw scores are converted into percentiles based on the distribution of 
scores within each group: a white applicant receives a percentile score 
based on the distribution of white scores; a black applicants score rep- 
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resents his percentile within che black distribution; and so on. Then 
the employer makes hiring decisions on the basis of these race-normed 
percenciles. Starting in che lace 1970s, the U.S. Department of Labor 
began promocing this solution, offering such race-normed scores for 
the General Aptitude Test Bactery (the GATB, described in Chap- 
cer 3). 44 

By the early 1980s, race norming had became a common solution to 
the employer’s dilemma. To see how race norming works, we may use 
the example of the populär Wonderlic PersonneI Test, a highly g-loaded 
paper-and-pencil test that takes just twelve minutes. In its rest manual 
in use during the 1980s, the Wonderlic Company gave precise in- 
structions for what it called “percentile selection”—its term for race 
norming—along with an “Ethnie Conversion Table.“ Suppose that five 
candidates—white, black, Latino, Asian, and American Indian— 
all got the Wonderlic’s mean score of 22 prior to any adjustment for 
group distributions. Using the Ethnie Conversion Table, the personneI 
office would then assign those five candidates, all of whom had identi- 
cal scores, to the 45th percentile (for the white), 80th percentile (for 
the black), 75th percentile (for the Latino), 55th percentile (for the 
Asian), and 60th percentile (for the American Indian), and those scores 
would thereafrer be treated as the “real“ scores. 14 ^ An employer could 
then hire from the rop down using these adjusted scores and expect to 
end up with ratios of employees that would avoid triggering the Uni¬ 
form üuidelines. 

In 1986, the U.S. Department of Justice challenged race norming on 
the grounds that it was an unlawful and unconstitutional violation of 
the rights of people who were neither black nor Latino. In our exam- 
ple, a black with a score of 80 would indeed have a much better chance 
of being hired than a white with a score of 45, though hoth had the same 
score on an unbiased, valid test. The Departments of Justice and Lahor 
adjudicated their differences, agreeing to study the method further. Race 
norming had few defenders in public, where its Unfairness seemed pal- 
pable. In the Givil Rights Act of 1991, race norming was banned for any 
employer subject to federal regulation. For now, this experiment in af¬ 
firmative action policy— ironically, by far the most efficient from a pro- 
ductivity standpoint and even the “fairest,” insofaras the highesr scorers 
at least won out in competition with members of their own group—has 
been suspended. 


Alternative IV: Retuming to the Original Conception of Affirmative Ac¬ 
tion 

We are dissatisfied with all of the foregoing alternatives and are broadly 
critical of the way in vvhich the well-intentioned effort ro end employ- 
ment discriminarion has played out. We therefore dose by urging con- 
sideration of this proposition: I ftomorrow alljob discrimination regulations 
based on group proportions were rescindcd, the United States would have a 
job market that is ethically fairer , more conducive to racial harmony , and eco¬ 
nomic ally more productive, than the one we have now. We cannor prove 
that the proposition is true (just as no one can prove that it is not), but 
here are two reasons for taking it seriously. 

The first is public approval of the old concept of fairness. Preferen- 
tial affirmative action has been a favorite cause of intellecruals, jour- 
nalists, and liberal politicians, but it has never been rooted in broad 
public support. Instead, according topolls taken in the 1970s and 1980s, 
most Americans favor hiring by ability test scores over preferenrial hir¬ 
ing for protected groups. At the same time, they approve of having the 
governmenr offer a helping hand—for example, by offering free courses 
to people to help them do better on ability tests used for employment. 
A clear majority of blacks similarly favor ability test scores over prefer- 
ential hiring. 40 A re turn to policies based on evenhandedness for incli- 
viduals (not for groups) seems sure to attraef enthusiastic and broad 
public support. 

Th e second reason is the potential for good faith. Our fundamental 
recommendation for the workplace resembles the one we offered for 
higher education: get rid of preferential affirmative action and return to 
the original conception of casting a wider net and leaning over back¬ 
ward to make sure that all minority applicants have a fair shot at the job 
or the promotion. To the extenr that the government has a role to play, 
it is to cnsure equality of opportunity, not of outcome. Once again, we 
anticipate that the main objection will be that ending affirmative ac¬ 
tion as now practiced will take us back to the bad old days. As we come 
to the end of our long wrestle with the new American Dilemma known 
as affirmative action, let us expand on our reasons for our optimism that 
the United Stares can do without it very well. 

Try this thought experiment on yourself. If all anridiscrimination law 
were rescindcd tomorrow, would you (if you are an employer) hire whites 
in preference to blacks or Latinos? Would you (if you are an employee) 
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begin lookingfor workplaces where you did not have to work with Nacks 
or Latinos? Would you (if you are a customer) seek out Stores and Ser¬ 
vices that did not have black or Latino personnel? We put the issue that 
way to expose a stränge dissonance among Americans. We are confi- 
dent that the answer to all of those questions by virtually all of the white 
readers of this hook is an emphatic, deeply feit u no.” May we even sug- 
gest that many of you would feel much happier about what you were do- 
ing if, as an employer, you spent your time concentrating on whether a 
minority applicant was the right person for the job rather rhan worry- 
ing about whether the applicant was likely to sue you if you turned him 
down; that, as an employee, you would find it a blesscd re lief to work in 
an office with black or Latino colleagues where it could he taken for 
granted by everyone that the personne 1 office had hired all of you using 
the samt* yardstick; that, as a consumer of Services, you wish you could 
choose a surgeon who happens to be an ethnic minority, hecause you 
could be confident thar his clegree meant rhe same thing for everyone 
who received it. 

We have no doubt that all of the above Statements are true for the 
vast majority of our readers, and yet many people are convinced that 
the populär ion as a whole would take ad van tage of the Situation if af¬ 
firmative action were ended. Talk about it with your friends, and you 
will find it to be a commonplace not limited to yourselt. Although they 
too are authenrically committed to treating people fairly regardless of 
race, color, or creed, they worry that massive bigotry still exists and will 
bring back the had old days as soon as the heavy hand ot the goveriv 
ment regulation ts liftecl from them. Ry odd happenstance, the people 
one knows personally are much tnore fair-minded than the people one 
doesn’t know personally. 

Is this really true? That bigotry still exists is incontestable. Rur that 
Joes not mean that bigotry would prevail in the American joh market 
as of the end of the twentieth Century if the vast machinery of anridis- 
crimination law did not exist. Much of what we have presented in rhis 
chapter about occupational gains by blacks in the years hefore and af- 
ter 1964 suggests the opposite. The civil rights movement authenrically 
raised white awareness of the oppression and exploitation of blacks in 
the job market. The trendlines in hoth white hehavior and black out- 
comes began to move in the right direction, gathering speed. The civil 
rights legislation came alongat the same time and probably cweakeJ the 
slopes of those trendlines in some instances. Rut rhe great, truth abour 


the 1960s was not that the nation finally enacted the civil rights laws 
but that the American people were finally and inexorably moving in 
the right direction anyway. We are asking that you consider seriously 
the proposition that it is feasible to remove antidiscrimination law, re- 
placing it with vigorous enforcement of the time-honored American 
principle that all citizens are equal betöre the law. 

As in the case of College admissions, some economic and occupa- 
tional reshuffling would occur. Some minorities would fail to get jobs 
that they get now. If, for example, the Washington Police Department 
returns to a policy of hiring the best-qualified candidates, a smaller Pro¬ 
portion of those new police would be black. Whcrever eise Standards 
have been lowered to increase the number of minorities in a workplace, 
the numher of minorities in those positions in that workplace would 
probably diminish. On the other hand, the quality of the Washington 
police force is likely to improve, which will be of tangible benefit to the 
hundreds of thousands of blacks who live in that city. Minorities in all 
walks of life will have lifted from them the post-1964 form of second- 
class citizenship that affirmative action has imposed on them. 

Much of the reshuffling that may be expected will not be bad even 
for those who are reshuffled. As matters stand, newly hired minority ex- 
ecutives in corporations offen enjoy short-term benefits (higher pay and 
Status at the front end than new graduates could ordinarily expect) but 
a career dead end. Rlacks in Companies that do husiness with the fed¬ 
eral government are routinely used in highly visible positions as evi- 
dence of affirmative action compliance and diverted trom the niore 
pedestrian but ultimately inore heneticial apprenticeship positions that 
the white etnployees have no choice but to serve. Minority business- 
people are channeled into the minority set-aside game, learning how to 
serve as fronts for contracts thar are acrually carried out hy whites, in- 
stead of running the husiness itself. Affirmative action has deformer! 
many aspects of American life, not least in twisting the ways in which 
minorities must try to get ahead. 

We will not try to estimate what the effects of doing away with job 
discrimination legislation would be for husiness productivity. The ef¬ 
fects would vary widely by industry and location in any case, from triv¬ 
ial to substantial. Nor will we spend much time talking about rhe 
benefits for whites, except to say that these benefits should be. countcd. 
It is easy for highly educated whites with many options to look henignly 
on affirmative action. It has little effect on their joh prospects. For a 
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young white man with fewer advantages who has wanted ro be a fire- 
fighter all his life and is passed over in favor of a less-qualified minority 
or female candidate, the costs loom larger. To dismiss his disappoint- 
ment and the hardships worked on him just because his skin is white 
and his sex is male is a peculiarly common—and cruel—reaction of peo- 
ple who hurst with indignation at every other kind of injustice. 

Whatever their precise amounts, the benefits to productivity and to 
tairness of ending the antidiscrimination laws are substantial. But our 
largest reason for wanting to scrap job discrimination law is our belief 
that the System of affirmative action, in education and the workplace 
alike, is leaking a poison into the American soul. This nation does not 
have the Option of ethnic balkanization. The increasing proportions of 
ethnic minorities—Latino, East Asian, South Asian, African, East Eu¬ 
ropean—make it more imperative, not less, thar we return ro the rnelt- 
ing pot as metaphor and color blindness as the ideal. Individualism is 
not ortly America’s heritage. lt must be its future. 


Chapter 21 

The Way We Are Headed 


In this penultimate chapter we speculate about the irnpact of cognitive 
stratification on American life and Government. Predicting the course 
of society is chancy, but certain tendencies seem strong enough to worry 
about: 

• An increasingly isolated cognitive elite. 

• A merging of the cognitive elite with the affluent. 

• A deteriorating quality of life for peopie at the bottom end of the 
cognitive ahility distribution. 

Unchecked, these trends will iead the U.S. toward something resem- 
hling a caste society, with the underciass mired ever more firmly at the 
bottom and the cognitive elite ever more firmly anchored at the top, re- 
structuring the rules of society so that it becomes harder and harder for 
them to lose. Among the other casualties of this process would be Amer¬ 
ican civil society as we have known it. Like other apocalyptic visions, 
this one is pessimistic, perhaps too much so. On the other hand, there 
is much to be pessimistic about. 

REC APITUL ATION: THE IN VISIBLE MIGRATION 

As we described in Part I, the cognitive elite refers to peopie in the top 
percentiles of cognitive ahility who, over the course of the American 
twentieth Century, have been part of a vast but nearly invisible migra- 
tion. The migration does not reveal itself in masses of humanity Cross¬ 
ing frontiers but in countless bits of data about the movement of 
individuals across the levels of society. Like all other great migrations, 
this one too will transform both the place peopie left and the place 
they go. 

At the beginning of the Century, the great majority of peopie in the 
top 5 or 10 percent of the intelligente distribution were not College ed- 
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ucated, ofren not even high school educated, and they lived their lives 
scattered almost indistinguishably among the rest of the population. 
Their interests were just as variegated. Many were small Businessmen or 
farmers, sharing the political Outlook of those groups. Many worked on 
assembly lines or as skilled craftsmen. The top of the cognitive ability 
distTihution probably included leaders of the lahor movement and ot 
community organizations. Among the smart women, a few had Profes¬ 
sional careers of their own, but most of them kept house, reared chil¬ 
dren, and were ofren the organizing forces of their religious and social 
communities. 

People from the top of the cognitive ability distribution lived next 
door to people who were not so smart, with whose children their own 
children went to school. They socialized with, went to churcli with, and 
married people less bright than themselves as a matter of course. This 
was not an egalitarian utopia that we are trying to recall. On the con- 
trary, communities were stratified by wealth, religion, dass, ethnic Back¬ 
ground, and race. The stratifications may have been stark, even bitter, 
but people were not stratified by cognitive ability. 

As the Century progressed, the historical mix of inrellectual abilities 
at all levels of American society rhinned as inrelligence rose to the rop. 
The upper end of the cognitive ability distribution has been increas- 
ingly channeled into higher education, especially the rop Colleges and 
Professional schools, thence into high-lQ occupations and senior man¬ 
agerial positions, as Part 1 detailed. The upshot is that rhe scattered 
brightest of rhe early twentieth Century have congregated, forming a 
new dass. 

Membership in rhis new dass, the cognitive dite, is gained by high 
IQ; neither social Background, nor ethnicity, nor lack of money will bar 
the way. But once in the club, usually by age eighteen, members begin 
to share much eise as well. Among other things, they will come ro run 
much of the country’s Business. In the private sector, the cognitive dite 
dominates the ranks of CEOs and the top echelon of corporate execu- 
tives. Smart people have no doubt always had the advantage in com¬ 
merce and industry, but their advantage has grown as the Barriers againsr 
the “wrong” nationalities, ethnicities, religions, or socioeconomic ori- 
gins have been dismantled. Meanwhile, the leaders in medicine, law, 
Science, print journalism, television, the film and Publishing Industries, 
and the foundation world come largely from the cognitive elite. Almost 
all of the leading figures in academia are part of it. ln Washington, the 


top echdons of federal officialdom, special interest groups, think tanks, 
and the rest of Washingtons satellite institutions draw heavily from the 
cognitive elite. At the municipal level, the local husiness and political 
movers are often members of the cognitive elite. 

G1V1NG MER1TOCRACY ITS DUE 

Part 1 mostly described a success story—success for the people lucky 
enough ro he part of the cognitive dite but also a success for the nation 
as a whole. Before turning to the dark side, we should he explicit about 
the good things that flow from the invisible migration. 

Chief among them is the triumph of an American ideal. Americans 
bdieve that each person should he able to go as far as talent and hard 
work will take hitn, and much of what we have described is the realiza- 
tion of that conviction, for people with high IQs. The breadth of the 
dränge was nvade possible by twentieth-century technology, which ex- 
panded the need for people with high IQs by Orders of magnitude. But 
the process itself has been a classic example of people free to respond to 
opportunity and of an economic system that created opportunities in 
ahundance. 

Life has been increasingly good for the cognitive elite, as it has dis- 
placed the socioeconomic elires of earlier times. We showed in Part I 
the increasing financial rewards for brains, but money is only a part of 
the cornueopia. ln the far-from-idyllic past when most of the people at 
the top of the cognitive distribution were farmers, housewives, workers, 
and shop owners, many of them were also frustrated, aware that they 
had capahilities that were not heing used. The graph on page 56 that 
traced the steep rise in high-IQ jobs over the course of the Century was 
to soine important extent a picture of people moving from unsatisfying 
johs to lucrative and interesting ones. 

Technology has not just created more jobs for the cognitive elite but 
revolutionized the way they may he done. Modern transportation has ex- 
panded the realm in which people work. Beyond that, physical Separa¬ 
tion is becoming irrelevant. A scientist passionately devoted to the study 
of a certain protein or an Investment analyst following a market can he 
in daily electronic conversation with people throughout the world who 
share the same passion, passing drafts of work back and forth, calling up 
data files, doing analyses that would have required a mainframe Com¬ 
puter and a covey of assistants only a few years ago—all while sitting 
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alone at a Computer, which need not he in an office, but can as easily be 
in a beach house overlooking the ocean. Across the occupational do- 
main of those who werk primarily with their minds, the explosion of 
Computer and Communications technologies has liherated and ex- 
panded creativity, productivity, and personal freedom. The re may be 
some costs of this physicai Isolation, but many people are happiev and 
more fulfilled as a result of the reach of modern technology. 

For the nation as a whole, the invisible migration has surely hrought 
benefits as weil. We cannot measure the gains precisely, but they are the 
inevitable side effect of greater efficiency in identifying intellectual tah 
ent and channeling it into high-IQ occupations. Compared to 1900 or 
even 1950, America in the 1990s is getting more productivity out of its 
stock of human Capital, and this presumably translates into more johs, 
gains in GNP, and other effects that produce more wealth for the soci- 
ety at large. 

So what’s the problem?The old stratifications are tading, erased hy 
a greater reliance on what people often call ment. Millions of people 
have benefited from the changes—including us. Would we prefer less 
of a meritocracy? Put that way, no—but “no” for larger reasons as well. 
The invisible migration is in many ways an expression of what Amer¬ 
ica is all about. 

ISOLATION WITHIN THE COGNITIVE ELITE 

What worries us first about the emerging cognitive elite is its coales- 
cence into a dass that views American society increasingly through a 
lens of its own. In The End of Equality, which analyz.es the stratification 
of American society from a vantage point different from ours, social 
critic Mickey Kaus describes the isolation we have in mind. He identi- 
fies it hroadly with the decline of “the public sphere.” 1 The end of the 
military draft, the social Segregation of the school System, and the divi- 
sive effects of the underclass are among his suspects, and each has doubt- 
lessplaycd an important role independent (tosomedegree) of the effects 
of the cognitive stratification that we described in Part I. Thinking 
about the way these forces had affected his own life, Kaus remarked: “1 
entered a good Ivy League College in 1969. 1 doubt Pve had a friend or 
regulär social acquaintance since who scored less than an 1100 on his 
or her SAT boards.” 2 

Kaus is probably right. The reason why this is a problem is captured 


by a remark attributed to the New Yorker’s one-time movie critic Paul ine 
Kael following Richard Nixons landslide victory in the presidential 
electionof 1972: “Nixon can’t have won; noone I know votedforhim."* 
When the members of the cognitive elite (of whatever political con- 
victions) hang out with each other, often exclusively with each other, 
they find it hard to understand what ordinary people think. 

The problem is not simply that smart people rise to the top more eh 
ficiently these days. If the only quality that CEOs of major corporations 
and movie directors and the White House inner circle had in common 
were their raw intelligence, things would not he so much different now 
rhan they have always been, for to some degree the most successful have 
always been drawn disproportionately from the most intelligent. But the 
invisible migration of the twentieth Century has done much more than 
let the most intellectually able succeed more easily. It has also segre- 
gated them and socialized them. The members of the cognitive elite are 
likely to have gone to the same kinds of schools, live in similar neigh- 
borhoods, go to the same kinds of theaters and restaurants, read the same 
magazines and newspapers, watch the same television programs, even 
drive the same makes of cars. 

They also tend to be ignorant of the same things. They watch far less 
commercial television than the average American. Their movie-going 
rends to be highly selective. They seldom read the national tabloids 
that have the nation’s largest circulation figures or listen to the talk ra- 
dio that has hecome a major form of national communication for other 
parts of America. This does not mean that the cognitive elite spend 
their lives at the hallet and reading Proust. Theirs is not a high culture, 
but it is distinctive enough to set them off from the rest of the country 
in many important ways. 

The isolation of the cognitive elite is by no means complete, but the 
Statistical tendencies are strong, and the same advances in trans- 
portation and communication that are so enhancing the Professional 
lives of the cognitive elite will make their isolation from the rest of the 
puhlic that much greater. As their common ground with the rest of 
society decreases, their coalescence as a new dass increases. The tra- 
ditional separations between the Business world, the entertainment 
world, the university intellectuals, and govemment are being replaced 
by an axis of bright people that runs through society. They already sense 
their kinship across these spheres of Interest. This too will increase with 
time. 
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THE COALITION OF THE COGN1T1VE ELITE AND THE 
AFFLUENT 

The trends we have described would not constitute a threat to the re- 
puhlic if the government still played the same role in oivic life rhat it 
played through the Eisenhower administration. As recently as 1960, it 
did not make a lot of political difference what the cognirive elite 
thought, because its power to impose rhose values on the rest of Amer¬ 
ica was limited. In most of the matters that coiinted—the way the 
schools were run, keeping order in the public square, opening a business 
or running it—the nation remained decentralized. The still inchoate 
cognitive elite in 1960 may have had ideas about how it wanted to move 
the world hut, like Archimedes, it lacked a place to stand. 

We need not become embroiled here in a dehate about whether the 
centralization of authority since 1960 (or 1933, for those who take a 
longer view) was right or wrong. We may all agree as a Statement offact 
that such centralization occurred, through legislation, Supreme Court 
decisions, and accretions of executive authority in every domain of daily 
life. With it came something rhat did not exist before: a place for the 
cognitive elite to stand. With the end of the historic limirs on the fed- 
eral reach, everything was up for grabs. If one political group could get 
enough votes on the Supreme Court, it could move the Consritution 
toward its goals. If it could get enough votes in Congress, it could do 
similarly with legislation. 

Through the 1960s, 1970s, and 1980s, the hattle veered hack and 
f°rth, with groups identifiably “liberal” and “conservative” hloodying 
each other’s noses in accustomed ways. But in the Rush and Clinton ad- 
ministrations, the old lines began to blur. One may analyze these trends 
conventionally in terms of the evolution of party politics. The rise of 
the New Democrats and the breakup of the Reagan coalition are the 
convenrional way of looking at the evolution. We think something eise 
is happen ing as well, with potential dangers: the converging inte res ts of 
the cognitive elite with the larger population of affluent Americans. 

For most of the Century, intellectuals and the affluent have been an- 
tagonists. Intellectuals have been identified with the economic left and 
the cultural avant-garde, while the affluent have been identified with 
big business and cultural conservatism. These comfortahle categories 
have become muddled in recent years, as faculty at the top universities 
put together salaries, Consulting fees, Speeches, and royalties that gar- 


ner them six-figure incomes while the New York Review of Book :s shows 
up in the mailbox of young corporate lawyers. The very bright have be¬ 
come rnuch more wniformly affluent than they used to be while, at the 
same time, the universe of affluent people has become more densely pop- 
ulated by the very bright, as Part 1 described. Not surprisingly, the in- 
terests of affluence and the cognitive elite have begun to blend. 

This meid ing has its limirs, particularly when the affluent person is 
not part of the cognitive elite. The high-IQ Stanford professor with the 
best-selling book and the ordinary-IQ fellow who mukes the same in- 
come with his small chain of shoe Stores are hardly allies on everything. 
Bin in looking ahead to alliances and social trends, it is still useful to 
think in terms of their increasing commonalities because, as any good 
economist or politician will point out, there are theoretical interests and 
pracrical interests. The Stanford professor’s best-selling book may be a 
diatribe against the punitive criminal justice System, but that doesn’t 
mean rhat he doesn’t vote with his feet to move to a safe neighborhood. 
Or his book may be a withering attack on outdated family norms, but 
that doesn’t mean that he isn’t acting like an old-fashioned father in 
looking after the interests of his children—and if that mcans sending 
his children to a lily-white private school so that they get a good edu- 
cation, so be it. Meanwhile, the man with the chain of shoe Stores may 
be politically to the right of the Stanford professor, but he is looking for 
the same saie neighborhood and the same good schools for his children. 
And even if he is more likely to vote Republican than the professor, he 
Ls un like ly to be the rugged individual ist of yore. On the contrary, he is 
likely to have become quite comfortahle with the idea that government 
is there ro be used. He and the professor may not be so far apart at all 
on bow they want to live their own personal lives and how government 
might serve those joinr and important interests. 

Consider the sheer size of this emerging coalition and how quickly 
the affluent dass as a whole (not just the cognitive elite) is growing. 
What is “affluence”? The median answer in 1992 when the Roper Or¬ 
ganization asked people how much annual income they would need “to 
fulfill all your dreams” was $82,100, which indicates where affluence is 
thought to starr by most Americans/ For purposes of this exercise, we 
will define affluence as beginning at an annual family income of 
$100,000 in 1990 dollars, about three times the median family income. 
By that definition, more than one out of twenty American families is 
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affluent, roughly double what it was a decade earlierd Furthermore, this 
growth has accompanied stagnant real income for the average family. 
Here is the last of the many graphs we have asked you to examine in 
this book. ln some ways, it is more loaded with social implications than 
any that have come before. 

In the 1970s, economic growth began to enlarge the affluent dass 


Median family income Percentage of families with 

(bars) incomes over $100,000 (line) 

$40,000- - 6 



1950 1960 1970 1980 1990 


The shaded years are ones in which real per eapita 
GNP dropped. All tlgures are based on 1990 dollars. 

Soura'.s: Median family income: U.S. Bureau of the Census 1991, Tahle R-4, supplemented 
with U.S. Bureau of the Census 1993, Tahle B-11. For families with incomes over $ 100,000, 
data from 1967-1990 are taken from U.S. Bureau of the Census 1991, Tahle B-3; U.S. Bu¬ 
reau of rhe Census, 1993, Tahle B-6. Figures for 1947-1964 are esumated from U.S. Bureau 
of the Census 1975, Series U 269-282, adjusred for differentes in definition of rhe family. 

The graph illustrates the reason for the intense recent interest in 
American income inequality. From the end of World War II until the 
early 1970s, average family income rose. Then in 1973, median family 
income Kit a peak. Part of the reason for the subsequent lack of progress 
has been the declining real wages for many categories of blue-collar johs, 
described in Chapter 4. Part of the reason has been the decline in rwo- 
parent families (economic progress continued, though modestly, for 
families consisting of married couples). In any case, the average Amer¬ 
ican family has been stuck at about the same place economically for 
more rhan twenty years. 
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For the affluent, the story diverges sharply. Until the early 1970s, the 
Proportion of families with $100,000 in 1990 purchasing power in- 
creased slowly and in tandem with the growth in median family income, 
But after progress for the average family stalled, it continued for the af¬ 
fluent. The steepest gains occurred during the 1980s, and Ronald Rea¬ 
gans policies ot the 1980s are commonly thought to he an important 
force (in praise or blame) for increasing rhe number of affluent. But 
economists know that there is a difficulty with this explanation, as you 
will see when you compare the 1970s with the 1980s. The rising Pro¬ 
portion of families with incomes of more than $100,000 sinee the early 
1970s does not seem ro he a fu net ion of any particular political party or 
policy, exeept insofar as those policies encourage an expanding econ- 
(any. It has gone with gains in real per eapita GNP (indieated by the 
unshaded hars in the graphic) whether those gains occurred under 
Richard Nixon, Jimmy Charter, Ronald Reagan, or George Bush/ 61 There 
is no reason to think that this trend will he much different under Bill 
Clinton or his successors, if the economy grows. The net result is that 
rhe affluent will constitute a major portion of the population in the rel- 
atively near luture, and they will increasingly be constituted of the most 
talented. 

Try to envision what will happen when 10 or 20 percent of the pop¬ 
ulation has enough income to bypass the social institutions they don’t 
like in ways that only the top 1 percent used to he able to do. Robert 
Reich has called it the “secession of the successful.”' The current Sym¬ 
bol of this phenomenon is the gated Community, secure behind its walls 
and guard posts, but many other signs are visible. The fax, modern, and 
Federal Express have already made the U.S. Postal Service nearly irrel¬ 
evant io the way that the affluent comnumicate, for example. A more 
portentous development is the private court System that businesses are 
beginning ro create. Or the mass exodus from public schools among 
those living in eitles, if they can afford it. Or the proliferation of private 
security forces for Companies, apartmenr houses, schools, malls, and any- 
where eise where people with money want to be safe. 

Try to envision what will happen to the political process. Even as of 
the early 1990s, the affluent dass is no longer a thin layer of rieh peo¬ 
ple but a political bloc to be reckoned with, Speaking in round num- 
hers (for the precise definitions of both groups are arbitrary), a coalition 
of the cognitive elite and the affluent dass now represents something 
well in excess of 5 percent of families and, because of their much higher 
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than average voting rates, somewhere in the vicinity of 10 to 15 per- 
eent of the vorerst The political clout of this group extends weil be- 
yond its rnere voting size because of its financial contributions to 
campaigns and because this group contributes a large proportion of lo¬ 
cal political Organizers. The combined weight of the cognitive eilte and 
the affluent is already considerable. But we asked you to envision to- 
morrow, not today. Do you think that the rieh in America already have 
too much power? Or do you think the intellectuals already have too 
much power? We are suggesting that a “yes” to hoth questions is prob- 
ably right. And if you think the power of these groups is too great now, 
just watch what happens as their outlooks and interests converge. 

Cynical readers will be asking what eise is new. The priviieged have 
always used the law to their advantage. Our own analysis is hardly novel; 
it is taken straight from a book of essays written more than two centuries 
ago, The Fecleralist. People are not naturally angelic but self-interested— 
eise, as Publius pointed out, governments would not be necessary in the 
first place. Politically, people form factions to pursue their common 
ends. Give them access to government power to further those ends, and 
they will take advantage of it. The only modest additions we make to 
these ancient truths are two propositions: First, as of the 1990s, the con- 
stitutional restraints on how a faction may use government to further 
its ends have loosened. Second, an unprecedented coalition of the smart 
and the nch will take advantage of this new latitude in new ways. 

FACING REALITY ABOUT THE UNDERCLASS 

What new ways? There are many possibilities, but the central ones all 
involve the underclass. We fear that a new kind of conservatism is be- 
coming the dominant ideology of the affluent—not in the social tradi- 
tion of an Edmund Burke or in the economic tradition of an Adam 
Smith but “conservatism” along Latin American lines, where ro he con- 
servative has often meant doing whatever is necessary to preserve the 
mansions on the hi Hs from the menace of the slums below. In the case 
of the United States, the threat comes from an underclass that has been 
with American society for some years but has been the subject of unre- 
alistic analysis and ineffectual, often counterproductive policy. The new 
coalition is already afraid of the underclass. In the next few decades, it 
is going to have a lot more to be afraid of. Now is the time to bring to¬ 


gether from many chapters throughout the book the implicationsof cog¬ 
nitive stratificarion for the underclass. 

The Face of Children 

Statistically, it is not good for children to be horn eirher to a single 
mother or a married couple of low cognitive ability. But the greatest 
Problems afflict children unlucky enough to be horn to and reared by 
unmarried mothers who are below average in intelligence—about 20 
percent of children currently being born. 191 They tend to do badly, 
socially and economically. They tend to have low cognitive ahility 
themselves. They suffer disproportionately from hehavioral problems. 
They will be disproportionately represented in prisons. They are less 
likely to marry than others and will themselves produce large propor- 
tions of the children born to single women of low intelligence. 

Attempts to compensate for cognitive disad van tage at birth have 
shown how extraordinarily hard it is to do. Many readers no doubt find 
the plight of children to be among the most compelling arguments for 
government activism, as we do. But inadequate nutrition, physical 
abuse, emotional neglect, lack of intellectual Stimulation, a chaotic 
home environment—all the things that worry us when we think about 
the welfare of children—are very difficult to improve from outside the 
home when the single mother is incompetent. Incompetent mothers are 
highly concentrated among the least intelligent, and their numbers are 
growing. In Chapter 15, we discussed differential fertility—a bloodless 
term—and suggested that the narion is experiencing dysgenic pres¬ 
sure—another bloodless term. In the metric of human suffering, in- 
creasing numbers of children are born into the conditions we most 
deplore and the conditions that government is most helpless to affect. 

What happens to the child of low intelligence who survives child - 
hood and reaches adulthood trying to do his best to be a productive Cit¬ 
izen? Out of the many problems we have just sketched, this is the one 
we choose to italicize: AU of the problems that these children experience will 
become worse rather than better as they grou> older , for the labor market they 
will amfront a few decades down the road is going to be much harderfor them 
to cope with than the labor market is now. There will still be jobs for low- 
skill labor, mostly with Service businesses and private households, but 
the natural wage for those jobs will be low. Attempts to increase their 
wage artificially (by raising the minimum wage, for example, or man- 
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dating job benefits) may backfire by making alternatives to human la- 
bor more affordable and, in many cases, by making the jobs disappear 
altogether. People in the bottom quartile of intelligence are becoming 
not just increasingly expendable in economic terms; they will sometime 
in the not-too-distant future become a net drag. In economic terms and 
barring a profound change in direction for our society, many people will 
be unable to perform that function so basic to human dignity: putting 
more into the world than they take out. 

Perhaps a revolution in teaching technology will drastically in- 
crease the productivity returns to education for people in the lowest 
quartile of intelligence, overturning our pessimistic forecast. Bui 
there are no harbingers of any such revolution as we write. And un- 
less such a revolution occurs, all the fine rhetoric about “investing 
in human Capital” to “make America competitive in the twenty-first 
Century” is not going to be able to overturn rhis reality: For many 
people, there is nothing they can learn that will repay the cost of the 
teaching. 

The Emerging White Underclass 

The dry tinder for the formation of an underclass Community is a large 
number of births to single women of low intelligence in a concentrated 
spatial area. Sometime in the next few decades it seems likely that 
American whites will reach the point of conflagration. The proportion 
of white illegitimste births (including Latinos) reached 22 percent in 
1991 HO] j|^ ere is not hing about being Caucasian that must slow down 
the process. Britain, where the white illegitimacy ratio, which was much 
lower than the American white ratio as recently as 1979, hit 32 percent 
in 1992 with no signs of slowing down. 

When 22 percent of all births are to single women, the proportion in 
low-income communities is perhaps twice that. ln the NLSY, 43 per- 
cent of all births to white women who were below the poverty line were 
illegitimate, compared to 7 percent for all white women anywhere above 
the poverty line. 1111 In the nation at large, we know from the 1992 
Census Bureau study of fertility that women with College degrees com 
tribute only 4 percent of white illegitimate babies, while women with a 
high school education or less contribute 82 percent. Women with fam- 
ily incomes of $75,000 or more contribute 1 percent of white illegiti- 
mate babies, while women with family incomes under $20,000 
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contribute 69 percent. 12 White illegitimacy is overwhelmingly a lower- 
dass phenomenon. 

In the past, whites have not had an “underclass” as such, because the 
whites who might qualify have been too scatrered among the working 
dass, lnstead, white communities in America had a few streets on the 
outskirts of town inhabited by the people who couldn’t seem to cope 
and skid rows of unattached white men in large cities, but these Scat¬ 
terings were seldom large enough to make up a neighborhood. An un¬ 
derclass needs a critical niass, and white America has not had one. But 
if the overall white illegitimacy ratio is 22 percent—probably some- 
where in the 40 percent ränge in low-income communities—and rising 
fast, the question arises: At what point is critical mass reached? How 
much illegitimacy can a Community tolerate? Nobody knows, but the 
historical fact is that the trendlines on black crime, dropout from the 
labor force, and illegitimacy all shifted sharply upward as the overall 
black illegitimacy ratio passed 25 percent and the rate in low-income 
black communities moved past 50 percent. 

We need not rely on the analogy with the black experience. White 
illegitimacy is also overwhelmingly a lower-cognitive-class phenome¬ 
non, as we detailed in Chapter 8. Three-quarters of all white illegiti¬ 
mate births are to women below average in IQ, and 45 percent are to 
women with IQs under 90. tn! These women are poorly equipped for the 
lahor market, often poorly equipped to be mothers, and there is no rea- 
son to think that the outcomes for their children will be any better than 
the outcomes have been for black children. Meanwhile, as never-mar- 
ried mothers grow in numbers, the dynamics of the public housing mar¬ 
ket (where they will probably continue to be welcome) and the private 
housing market (where they will not) will feester increasing concentra- 
tions of whites with high unemployment, high crime, high illegitimacy, 
and low cognitive ability, creating communities that look very much 
like the inner-city neighborhoods that people now tend to associate 
with minorities. 

The white cognitive elite is unlikely to greet this development sym- 
pathetically. On the contrary, much of white resentment and fear of the 
black underclass has been softened by the complicated mixture of white 
guilt and paternalism that has often led white elites to excuse behavior 
in blacks that they would not excuse in whites. This does not mean that 
white elites will abandon the white underclass, but it does suggest that 
the means of dealing with their needs are likely to be hrusque. 
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Spatial Concentraäon, Low Cognitwe Ability, and Underclass Behavior 

As the patience of whites for other whites wears thiti, the black inner 
city will simultaneously be getting worse rather rhan better. Various 
scholars, led by William Julius Wilson, have described the outmigration 
of the ablest blacks that has left the inner city without its former lead- 
ers and role mode 1s. 14 Given a me an black IQ of ahout 85 and the link 
hetween socioeconomic Status and IQ within ethnie populations, rhe 
implication is that the black inner city has a population with a mean 
IQ somewhere in the low 80s at best, with a correspondingly small tail 
in the above-average ränge. (1:>l 

What is the minimum level of cognitive resources necessary to sus- 
tain a community at any given level of social and economic complex- 
ity? For sustaining a village of a few hundred people in a premodern 
society, the minimum average level is probably quite modest. What is 
it for sustaining a modern community? The question is of enormous 
practical significance yet remains innocent of any empirical investiga- 
rion whatsoever. Perhaps the crucial feature is the average cognitive 
ability. Perhaps it is the size of the cadre of high-ability people. Perhaps 
it is the weight of the population at low end of the distrihution. No one 
knows. Whatever the details, a prima facie case exists that the cognitive 
resources in the Contemporary inner city have fallen below rhe mini¬ 
mum level. What looked like a rising tide of social problems a genera- 
tion ago has come to look more like a fundamental breakdown in social 
organization. 

One may look for signs that these communities are ahout to recover. 
The crack cocaine epidemic of the 1980s has ebbed, for example, ab 
though crack is cheaper than ever, as the savage effects of the drug he- 
came evident to younger brothers and sisters. Black grass-roots efforts 
to restore the family and combat crime have increased in recent years. 
But counterpoised against these forces working on behalf of regenera- 
tion within the inner city is a powerful force working against it: A large 
majority of the next generation of blacks in the inner city is grovving up 
without fathers and with limited cognitive ability. The numbers con¬ 
ti nue to increase. The outmigration of the able continues. 

While we can see how these trends might be reversed, which we de- 
scribe in the next and final chapter, let us consider the prospecr we face 
if they do not. This Frings us to the denouement of our prognosis. 
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THE COMING OF THE CUSTODIAL STATE 

When a society reaches a certain overall level of affluence, the haves 
hegin to feel sympathy toward, if not guilt ahout, the condition of the 
have-nots. Thus Jawns the welfare state—the attempt to raise the poor 
and the needy out of their plight. In what direction does the social web 
fare System evolve when a coalition of the cognitive elite and the af- 
fluent continues to accept the main tenets of the welfare state but are 
increasingly frightened of and hostile toward the recipients of help? 
When the coalition is prepared to spend money but has lost faith that 
remedial social programs work?The mosr likely consequence in our view 
is that rhe cognitive elite, with its commanding Position, will imple- 
ment an expanded welfare state for the underclass that also keeps it out 
from underfoot. Our label for this outcome is the custodial state. 
Should it come to pass, here is a scenario: 

Over the next decades, it will hecome hroadly accepted by the cog¬ 
nitive elite that the people we now refer to as the underclass are in that 
condition through no fault of their own but hecause of inherent short- 
comings about which little can be done. Politicians and intelleetuals 
alike will hecome much more open about the role of dysfunctional be¬ 
havior in the underclass, accepting that addiction, violence, unavaib 
ability for werk, child abusc, and family disorganization will keep most 
members of the underclass from fending for themselves. It will be agreed 
that the underclass cannot be trusted to use cash wisely. Therefore pol- 
icy will consist of grearer benefits, but these will be primarily in the form 
of Services rather than cash. Furthermore, the re will be new restrictions. 
Specifically, these consequences are plausible: 

Child care in the inner city will hecome primarily the respunsibility of the 
state . Infants will get better nutrition hecause they will be spending their 
days in day care Centers from infancy. Children will get balanced diets 
hecause they will be eating breakfast, lunch, and perhaps supper at 
school. Day care Centers and schools for elementary students will edge 
closer toward comprehensive care facilities, whose staff will try to pro- 
vide not only education and medical care but to train children in hy" 
giene, sexual socialization, socialization to the world of work, and other 
funcrions that the parenrs are deemed incapable of providing. 

The homeless will vanish. One of the safer predictions is that sometime 
in the near future, the cognitive elite will join the broad public senti- 
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ment in favor of reasserting control over public spaces. It will become 
easier to consign mentally incompetent adults to custodial care. Perhaps 
the clinically borderline cases that now constitute a high proportion of 
the homeless will be required to reside in shelters, more elaborately 
equipped and staffed than most homeless shelters are today. Police will 
be retumed their authority to roust people and enforce laws prohibiting 
disorderly conduct. 

Strict policing and custodial responses to crime will become more accept - 
able and widespread. This issue could play out in several ways. The crime 
rate in affluent suburbs may be low enough to keep the pressure for re- 
form low. But events in the early 1990s suggest that fear of crime is ris- 
ing, and support for strict law enforcement is increasing. 

One possibility is that a variety of old police practices—especially the 
stop-and-frisk—will quietly come back into use in new guises. New pris- 
ons will continue to be built, and the cells already available will be used 
more efficiently to incarcerate dangerous offenders (for example, by 
eliminating mandatory sentences for certain drug offenses and by in- 
carcerating less serious offenders in camps rather than prisons). Tech¬ 
nology will provide new options for segregating and containing 
criminals, as the electronic bracelets now being used to enforce house 
arrest (or perhaps “neighborhood arrest”) become more flexible and 
foolproof. Another possibility is that support will grow for a national 
System of identification cards, coded with personal Information includ- 
ing criminal record. The possibilities for police surveillance and control 
of behavior are expanding rapidly. Until recently, the cognitive elite has 
predominantly opposed the use of such technology. In a few years, we 
predict, it will not. 

The underclass will become even more concentrated spatially than it is to¬ 
day. The expanded network of day care Centers, homeless shelters, pub¬ 
lic housing, and other Services will always be located in the poorest part 
of the inner city, which means that anyone who wants access to them 
will have to live there. Political support for such measures as relocation 
of people from the inner city to the suburbs, never strong to begin with, 
will wither altogether. The gaping cultural gap between the habits of 
the underclass and the habits of the rest of society, far more impassable 
than a simple economic gap between poor and not poor or the racial 
gap of black and white, will make it increasingly difficult for children 
who have grown up in the inner city to function in the larger society 
even when they want to. 


The underclass will grow. Düring the 1980s, scholars found evidence 
that the size of the underclass was no longer expanding. 17 But even as 
they wrote, the welfare rolls, which had moved within a narrow ränge 
since the late 1970s, began to surge again. The government will try yet 
another round of the customary social programs—sex education, job 
training, parenting training, and the like—and they will be as ineffec- 
tual this round as they were in the 1960s and 1970s. 18 Meanwhile, many 
low-income parents who try to do all the right things and pass their val- 
ues on to their children will be increasingly unable to do so. They can- 
not propagate their norms in the face of a local culture in which 
illegitimacy, welfare, crime, and drugs are commonplace, and there is 
nothing magically invulnerable about them or their children. Some of 
the reforms we have described will be improvements—crime might ac- 
tually drop in the inner city as well as in the other parts of town, for ex¬ 
ample—but the main effect will be to make it harder for the children 
in these solid and conventional working-class families to emulate their 
parents. Marriage, steady employment, and responsible behavior of 
many kinds will fall among the next generation, and some portion of 
the working dass will become members of the underclass. Few children 
of those already in the underclass will escape. 

Social budgets and measures for social control will become still more cen - 
tralized. The growing numbers of illegitimate children born to poor 
women will have multiplier effects on social welfare budgets—directly 
and through increased indirect costs generated in the educational and 
law enforcement Systems. As States become overwhelmed, the current 
cost sharing between the States and federal government will shift toward 
the federal budget. The mounting costs will also generate intense po- 
litical pressure on Washington to do something. Unable to bring itself 
to do away with the welfare edifice—for by that time it will be assumed 
that social chaos will follow any radical cutback—the government will 
continue to try to engineer behavior through new programs and regu- 
lations. As time goes on and hostility toward the welfare-dependent in- 
creases, those policies are likely to become authoritarian and rely 
increasingly on custodial care. 

Racism will reemerge in a new and more virulent form . The tension be¬ 
tween what the white elite is supposed to think and what it is actually 
thinking about race will reach something close to a breaking point, This 
pessimistic prognosis must be contemplated: When the break comes, 
the result, as so often happens when cognitive dissonance is resolved, 
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will be an overreaction in the other direction. Insteiui ot the c.andor and 
realism abour race that is so urgently needcd, the nation will be faeed 
with racial divisiveness and hostility that Ls as great as, or greater, than 
America experienced hefore the civil rights movement. We realize how 
outlandish it seems to predict that educated and influential Americans, 
who have been so puritanical about racial conversation, will openly re- 
vert to racism. We would not go so far as to say it is probable, lt is, how- 
ever, more than just possible. If it were to happen, all the scenarios for 
the custodial state would be more unpleasant—more vicious—than 
anyone can now imagine. 

In short, by custodia! state, we have in mind a high-tech and more lav- 
ish Version of the Indian reservation for some substantial minority of 
the nation’s population, while the rest of America tries to go abour irs 
bus iness. ln its less benign for ms, the Solutions will hecome more and 
more totalitarian. Benign or otherwise, “going abour its business” in the 
old sense will not be possible. lt is difficult to imagine the United States 
preserving its heritage of individualism, equal rights hefore the law, free 
people running their own lives, once it is accepted that a significant part 
of the population must be made permanent wards of the state. 

Extrapolating from current trends, we project that the policies of cus- 
todialism will be not only tolerated but actively supported by a Consen¬ 
sus of the cognitive elite. To some extent, we are not even real ly 
projecting but reporting. The main difference between the pnsition of 
the cognitive elite that we portray here and the one that exists rnday is 
to some extent nothing more than the distinction between tacir and ex- 
plicit. 

If we wish to avoid this prospect for the future, we cannot count on 
rhe natural course of events to make things come out right. Now is the 
time to think hard about how a society in which a cognitive elite dorn- 
inares and in which helow-average cognitive ahility is increasingly a 
handicap can also he a society that makes good on the fundamental 
promise of the American tradition: the opportunity for everyone, not 
just the lucky ones, to live a satisfying life. That is the task to which we 
now rum. 


Chapter 22 

A Place for Everyone 


H ow should policy deal with the twin realities that people differ in 
intelligence for reasons that are not their fault, and that intelli- 
gence has a powcrtul bearing on how well people do in life! 

The answer of the twentieth Century has been that government 
should create the equality of condition that society has neglected to pro- 
duce on its own. The assumption that egalitarianism is the proper ideal, 
however difficult it may be to achieve in practice, suffuses contempo- 
rary political theory. Socialism, communism, social democracy, and 
Ameriea’s welfare state have heen different ways of moving toward the 
egalitarian ideal. The phrase social justice has hecome virtually a syn¬ 
onym for economic and social equality. 

Until now, these political movements have focused on the evils of 
Systems in produc ing inequality. Human heings are potential ly pretty 
much the same, the dominant political doctrine has argued, except for 
the inequalities produced by society. These same thinkers have gener- 
ally rejected, often vitriolically, arguments that individual differences 
such as intelligence are to blame. But there is no reason why they could 
not shift ground. ln many ways, the material in this book is tailor-made 
for their case. If it’s not someone’s fault that he is less intelligent than 
others, why should he be penalized in his income and social Status? 

We could respond with a defense of income differences. For exam- 
ple, it is justified to pay the high-IQ businessman and engineer more 
than the low-IQ ditch digger, producing income inequality, because 
that’s the only way to make the economy grow and produce more wealth 
in which the ditch digger can share. We could grant thar it is a matter 
not of just deserts but of economic pragmatism about how to produce 
compensating benefits for the least advantaged members of society. 1 
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Such arguments make sense to us, as far as they go. After the experienee 
of the fwentieth Century, it is hard to imagine that anyone still disagrees 
with them. But there are other issues, transcending the efficiency of an 
economy. Our central concern since we began writing this book is how 
people might live together harmoniously despite fundamental individ¬ 
ual differences. The answer lies outside economics. 

The initial purpose of this chapter is to present for your considera- 
tion another way of thinking about equality and inequality. It represents 
an older intellectual tradition than social democracy or even socialism. 
ln our view, it is also a wiser tradition, more attuned to the way in which 
individuals go about living satisfying lives and to the ways in which so- 
cieties thrive. The more specific policy conclusions to which we then 
turn cannot he explained apart from this underpinning. 

THINKING ABOUT EQUALITY AS AN IDEAL 

For thousands of years, great political thinkers of East and Wesr tried to 
harmonize human differences. For Confucius, society was like bis coiv 
ception of a family—extensions of a ruling father and ohediem sons, de- 
voted husbands and faithful wives, benign masters and loyal servants. 
People were defined hy rheir place, vvhether in the family or rhe Com¬ 
munity. So too for the ancient (jreek and Roman philosophers: place 
was all. All the great religious traditions define a place for everyone, if 
not on earth then in heaven. 

Society was to be ruled hy the virruous and wise few. The everyday 
business of the Community feil to the less worrhy multitude, with the 
most menial chores left to the slaves. Neither the Greek democrats nor 
the Roman rcpublieans helieved that “all men are created equal.” Nor 
did the great Hindu thinkers of the Asian subcontinent, where one’s 
work defined onescaste, which in turn circumscribed every other aspect 
of life. The ancients accepted the hasic premise that people differ funda- 
mentally and importantly and searched for ways in which people could 
contentedly serve the community (or the monarch or the tyrant or rhe 
gods), rather than themselves, despite their differences. Philosophers ar- 
gued about obligations and duties, what they are and on whom they fall. 

ln our historical era, political philosophers have argued instead ahout 
rights. They do so because they are trying to solve a different prohlem. 
The great Transformation from a search for duties and obligations to a 
search for rights may he dated with Thomas Hobbes, writing in the mid- 


1600s about a principle whereby all people, not just the rieh and well 
born, might have equal rights to liberry. 2 Everyone, said Hobbes, is enti- 
tled to as much liberty in gratifying bis desires as he is willing to allow 
others in gratifying theirs.^ 1 People differ, acknowledged Hobbes, but 
they do not differ so much that they may justifiably be deprived of lib¬ 
erty by differing amounts, In the modern view that Hobbes helped shape, 
individuals freely accept constraints on their own behavior in exchange 
for ridding themselves of the dangers of living in perfect freedom, hence 
perfect anarchy.^' The constraints constitute lawful government. 

Hobbes helieved that the only alternatives for human society are, in 
effect, anarchy or absolute monarchy. Given those alternatives, said 
Hobbes, a rational person would choose a monarch to ensure the equal- 
ity of political rights, rather than take his chances with perfect freedom. 
His successor in English political thought, John Locke, did not accept 
the Hobhesian choice between despotism and anarchy. He conceived 
of people in a state of nature as being in u a State also of Equality , wherein 
all the Power and Jurisdiction is reciprocal, no one having more than 
another,and sought to preserve that condition in actual societies 
through a strictly limited government. What Locke propounded is es- 
pecially pertinent here because it was his theory that rhe American 
Founders brought into reality. 

But with Locke also arose a confusion, which has grown steadily with 
passing time. For most Contemporary Americans who are aware of Locke 
at all, he is identified with the idea of man as tabula rasa, a blank slat.e 
on which experienee wrires. Without experienee, Locke is often be- 
lieved to have said, individuals are both equal and empty, a blank slate 
to be written upon by the environment. Many Contemporary libertari- 
ans who draw their Inspiration from Locke are hostile to the possibility 
of genetic differences in intelligence because of their conviction that 
equal rights apply only if in fact people at birrh are tabulae rasae. With 
that in mind, consider these remarks about human intelligence from 
Locke’s An Essay on Human Undersuinding: 

Now that there is such a difference between men in respect of their 
understandings, 1 think nobody who has had any conversation with 
his neighbors will question. . . . Which great difference in men’s in- 
tellecruals, whether it rises from any defect in the organs of the body 
particularly adapted to thinking, or in the dullness or untracrableness 
of those faculties for want of use, or, as some think, in the natural dif- 




530 Living Togecher 


A Place for Everynne 5.31 


ferences of mens souls themselves; or seine or all of these together, 
ir matters not here to examine. Only this is evident, that there is a 
difference of degrees in men’s understandings, apprehensions, and 
reasonings, to so great a latitude that one may, without doing injury 
to mankind, affirm that there is a greater distance between some men 
and others in this respect, than between some men and some beasts/’ 

Locke is strikingly indifferent to the source of cognitive differences 
and strikingly harsh in his judgment about their size. But that does not 
mean he believed people to have different rights. They are equal in 
rights, Locke proclaimed, though they be unequal in everything eise. 
Those rights, however, are negative rights (to impose Contemporary ter- 
minology): They give all human heings the right not to have certain 
things done to them hy the state or by other human heings, not the right 
to anything, except freedom of action. 

This way of putting it is out of tune with the modern sensibility. The 
original coneept of equal rights is said to be meaningless canr, out- 
moded, taking equal iights seriously, it is thought, requires enforcing 
equal outcomes. The prevailing political atritude is so dismissive toward 
the older conception of equal rights that it is difficult to think of seri- 
ous public rreatments of it; the Founders just didn’t think hard enough 
about that problem, it seems to be assumed. If he were alive today, some 
eminent political scientists have argued, Thomas Jefferson would surely 
be a social democrat or at least a New Deal Democrat. 7 We are asking 
that you consider the alternative: that the Founders were fully aware of 
how unequal people are, that they did not try to explain away natural 
inequalities, and that they nonetheless thought the best way for people 
to live together was under a System of equal rights. 

The Founders wrote frankly about the inequality of men. For Thomas 
Jefferson, it was obvious that they were especially unequal in virtue and 
intelligence. He was thankful for a “natural aristocracy’’ that could 
counterbalance the deficiencies of the others, an “aristocracy of virtue 
and talent, which Nature has wisely provided for the direction of the 
interests of society.” 8 It was, he once wrote, “the most precious gift of 
nature, and he thought that the best government was one that most 
efficiently brought the natural aristocracy to high positions. 9 

Jefferson saw the consequences of inequalities of ability radiating 
throughout the institutions of.society. The main purpose of education, 
he believed, was to prepare the natural aristocracy to govern, and he did 


not mince words. The “best geniuses” should be “raked from the rub- 
bish annually” by coinpetitive grading and examinations, sent on to the 
next educational stage, and finaliy cailed to public lifed I0] But if the au- 
thor of the Declaration of Independence was by today’s Standards unre- 
pentantly elitist, he was nonetheless a democrat in his belief that the 
natural aristocracy was “scattered with equal hand through all [of soci- 
ety’s] conditions,” 11 and in his confidence that the electorate had the 
good sense to choose them. “Leave to the citizens the free election and 
Separation of the aristoi from the pseudo -aristoi” he advised. “ln general, 
they will elect the real good and wise.” 12 For Madison, the “great re- 
publican prineiple” was that the common people would have the pub- 
lic-spiritedness and the Information necessary to choose “men of virtue 
and wisdom” to govern them. 11 For both Jefferson and Madison, politi¬ 
cal equality was both right and workahle. They would have been amazed 
hy the notion that humans are equal in any other sense, 

Nor were Jeffersons and Madisons views a reflection of their South¬ 
ern heritage. John Adams, that quintessential Yankee, agreed that “nat¬ 
ural aristocracy is a fact essential to be considered in the institution of 
government”—or, as he put it in another instance, “I believe there is as 
much in the hreed of men as there is in that of horses.” M He was not as 
optimistic as Jefferson and Madison, for he was keenly aware that in¬ 
telligence does not necessarily go with virtue, and he was fearful that 
Jefferson’s natural aristocracy would within a few generations have ce- 
mented its descendants’ positions into that of a ruling caste. But he did 
not doubt that the reality of human inequalities was of central political 
importance. 11 ^ 1 

The other Founders, including Hamilton and Washington, rumi- 
nated in the same vein about the inequality of men and the political 
implications of that inequality. In doing so, they were following an an¬ 
dern tradition. Political philosophers have always begun from the un- 
derstanding that good policy must be in accordance with what is good 
for human heings, and that what is good for humans must be based on 
an understanding of how they are similar and how they differ. Aristotle 
put it earliest and perhaps best: “All men believe that justice means 
equality in some sense. . . . The question we must keep in mind is, equal¬ 
ity or inequality in what sort of thing.” 16 

The Founders saw that making a stähle and just government was dif¬ 
ficult precisely because men were unequal in every respect except their 
rieht to advance their own interests. Men had “different and unequal 
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faculties of acquiring property,” Madison reflected in The FederalistT 
This diversity was the very reason why rights of property were so im¬ 
portant and why “the protection of those faculties is the first object of 
Government*” But the diversity was also the defect of populist democ- 
racy, because the unequal distribution of property to which it led was 
“the most common and durable source offactions.” And faction, he ar- 
gued, was the great danger that the Constitution sought above all to 
confine and tarne. The task of government was to set unequal persons 
into a system of laws and procedures that would, as nearly as possible, 
equalize their rights while allowing their differences to express them- 
selves. The result would not necessarily be serene or quiet, but it would 
be just* It might even work. 

In reminding you of these views of the men who founded America, 
we are not appealing to their historical eminence, but to their wisdom* 
We think they were right. Let us stop using words like factions and fac¬ 
ulties and aristoi and state in our own words, briefly and explicitly, how 
and why we think they were right in ways that apply today. 

The egalitarian ideal of Contemporary political theory underestimates 
the importance of the differences that separate human beings* It fails to 
come to grips with human Variation* It overestimates the ability of po¬ 
litical interventions to shape human character and capacities. The Sys¬ 
tems of government that are necessary to carry out the egalitarian agenda 
ignore the forces that the Founders described in The Feder allst, which 
lead inherently and inevitably to tyranny, throughout history and across 
cultures* These defects in the egalitarian tradition are reflected in polit¬ 
ical experience, where the failure of the communist bloc to construct 
happy societies is palpably apparent and the ultimate fate of even the 
more benign egalitarian model in Scandinavia is coming into question* 

The perversions of the egalitarian ideal that began with the French 
Revolution and have been so plentiful in the twentieth Century are not 
accidents of history or produced by technical errors in implementation. 
Something more inevitable is at work. People who are free to behave 
differently from one another in the important affairs of daily life in¬ 
evitably generate the social and economic inequalities that egalitarian- 
ism seeks to suppress* That, we believe, is as close to an immutable law 
as the uncertainties of sociology permit. To reduce inequality of condi¬ 
tion, the state must impose greater and greater uniformity. Perhaps that 
is as close to an immutable law as political Science permits. In T* H. 
Whites Version of the Arthurian legend, The Once and Future King , 


Merlyn transforms young Arthur into an ant as part of his education in 
governance. In this guise, Arthur approaches the entrance to the ant 
colony, where over the entrance are written the words, EVERYTHING NOT 
FORBIDDEN IS COMPULSORY * 18 Such, in our view, is where the logic of the 
egalitarian ideal ultimately leads* It is appropriate in the ant colony or 
the beehive but not for human beings. Egalitarian tyrannies, whether 
of the Jacobite or the Leninist variety, are worse than inhumane. They 
are inhuman. 

The same atmosphere prevails on a smaller scale where ver “equality” 
comes to serve as the basis for a diffuse moral outlook. Consider the 
many small tyrannies in America’s Contemporary universities, where it 
has become objectionable to say that some people are superior to other 
people in any way that is relevant to life in society. Nor is this outlook 
confined to judgments about people. In art, literature, ethics, and cul- 
tural norms, differences are not to be judged. Such relativism has be¬ 
come the moral high ground for many modern commentators on life 
and culture. 

Even the existence of differences must be discussed gingerly, when 
they are human differences. As soon as the differences are associated 
with membership in a group, censorship arises. In this book, we have 
trod on one ofthose most sensitive areas by talking about ethnic differ¬ 
ences, but there are many others. In what respects do men differ from 
women? Young differ from old? Heterosexuals from homosexuals? The 
permissible answers, often even the permissible questions, are sharply 
circumscribed. The moral outlook that has become associated with 
equality has spawned a vocabulary of its own. Discrimination, once a 
useful word with a praiseworthy meaning, is now almost always used in 
a pejorative sense. Racism, sexism, ageism, elitism—all are in common 
parlance, and their meanings continue to spread, blotting up more and 
more semantic territory. 

The ideology of equality has done some good. For example, it is not 
possible as a practical matter to be an identifiable racist or sexist and 
still hold public office. But most of its effects are bad. Given the power 
of Contemporary news media to imprint a nationwide image overnight, 
mainstream political figures have found that their allegiance to the 
rhetoric of equality must extend very far indeed, for a single careless re- 
mark can irretrievably damage or even end a public career. In everyday 
life, the ideology of equality censors and straitjackets everything from 
pedagogy to humor. The ideology of equality has stunted the ränge of 
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moral dialogue co criviaiity. In daily life—conversations, the lessons 
raught in public schools, the kinds of screenplays or newspaper feature 
stories that people choose to write—the moral ascendancy of equality 
has made it difficult to use concepts such as virtue, excellence, beauty 
and—above all—truth. 

Within the realm of govemment, small versions of the “everything 
not forbidden is compulsory” mentality may be seen everywhere. The 
informal old American prineiple governing personal behavior was that 
you could do whatever you wanted as long you didn’t force anyone eise 
to go along with you and as long as you let the other fei low go about his 
affairs with equal freedom. The stopping point was defined by the use- 
ful adage, u Your freedom to swing your arm stops where my nose he- 
gins.” w ln laws great and small, this prineiple has been perverted beyond 
recognition, as the notions of what constirutes “where my nose begins” 
Stretch lar out into space. The practice of affirmative action has been a 
classic example of the “everything not forbidden is compulsory” men¬ 
tality, as the idea of forbidding people to discriminare by race mutated 
into the idea of compelling everyone to help produce equal outcotnes 
by race. In rort law, the destruction of the concept of negligence grew 
out of an explicitly egalitarian view of the purpose of liability—not to 
redress individual victims für acts of irresponsibility but to redistribute 
goods more equitably. 20 ln personal life, the idea of forbidding people 
from interfering with members of other groups (blacks, homosexuals, 
women) as they went about their lives has been extended to the idea of 
compelling people to“trear them the same.” It is a markof how far things 
have gone that many people no longer can see the distinction between 
“not: interfering” and “trearing the same.” 

Our views on all of these issues are decidedly traditional. We think 
that rights are embedded in our freedom to act, not in the obligarions 
we may impose on others to act; that equality of rights is crucial while 
equality of outcome is not; that concepts such as virtue, excellence, 
beauty, and truth should be reintroduced into moral discourse. We are 
comfortahle with the idea that some things are better than others—not 
just according to our subjective point of view but according to endur- 
ing Standards of merit and inferiority—and at the same time reject the 
thought that we (or anyone eise) should have the right to impose those 
Standards. We are enthusiastic about diversity—the rieh, unending di- 
versity that free human beings generate as a matter of course, not the 
imposed diversity of group quotas. 


And so we come to this final chapter, discussing the broadest policy 
implications of all that has gone before. We bring to our recommenda- 
tions a predisposition, believirig that the original American conceptions 
of human equality and the pursuit of happiness still off er the wisest guid- 
ance for thinking about how to run today s America. These have been 
some of our reasons why. 

LETTING PEOPLE FIND VALUED PLACES IN SOCIETY 

Wirb these thoughts on the table, let us return to the question that 
opened the drapier: Hure should policy deal with the twin realities that peo¬ 
ple differ in inte Ui ge nee for reasons that are not their fault and that Intelligence 
has a powerful bearing on how well people do in life? The answer turns us 
back to the ancient concern with place. 

The Goal and a Definition 

The broadest goal is a society in which people throughout the functional 
ränge of intelligence can find, and feel they have tound, a valued place 
for themselves. For “valued place,” we offer a pragmatic definition: You 
occupy a valued place if other people would miss you ij you were gone. The 
fact that you would be vnissed means that you were valued. Both the 
quality and quantity of valued places are important. Most people hope 
to find a soulmate for life, and that means someone who would “miss 
you” in the widest and most intense way. The definition captures the 
reason why children are so important in defining a valued place. But be- 
sides the quality of the valuing, quantity too ls important. It a single per- 
son would miss you and no one eise, you have a fragile hold on your 
place in society, no matter how much that one person cares for you. To 
have many different people who would miss you, in many different parts 
of your life and at many levels of intensity, is a hallmark of a person 
whose place is well and thoroughly valued. One way of thinking about 
policy options is to ask whether they aid or obstruct this goal of creat- 
ing valued places. 

Fmding Valued Places 

The great hulk of the American population is amply equipped, in their 
cognitive resources and in other personal characteristics, to find valued 
places in society. We must emphasize that, because forhundreds of pages 
we have focused on people at the two tails of the bell curve. Now is a 
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good time to recall the people in the broad part of the curve, between 
the extremes* In figure after figure throughout Chapter 16, the pattern 
was consistent: The prevalence of the social maladies we reviewed was 
strikingly concentrated in the bottom IQ deciles* By the time people 
were even approaching average IQ, the percentages of people who were 
poor, had babies out of wedlock, provided poor environments for their 
children, or exhibited any other problem constituted small percentages 
of the population* Translated into the themes we are about to introduce, 
the evidence throughout this book Supports the proposition that most 
people by far have enough Intelligence for getting on with the business 
of life* We believe the policies we advocate will benefit them as well, 
by creating a generally richer and more vital society, but it should be 
made explicit: Our Solutions assume that the average American is an 
asset, not part of the problem. 

Finding Valued Places IfYou Arent Very Smart: The Traditional 
Context 

Nonetheless, millions of Americans have levels of cognitive ability low 
enough to make their lives statistically much more difficult than life is 
for most other people. How may policy help or obstruct them as they go 
about their lives? Our thesis is that it used to be easier for people who 
are low in ability to find a valued place than it is now* 

In a simpler America, being comparatively low in the qualities mea- 
sured by IQ did not necessarily affect the ability to find a valued niche 
in society. Many such people worked on fartns. When farms were small, 
technology was limited to the horse-drawn plow and a few hand tools, 
and the same subsistence crops were grown year after year. People who 
would score 80 or 90 on an IQ test could be competent farmworkers, 
not conspicuously distinguished from most other people in wealth, 
home, neighborhood, or Status in the community. Much the same could 
be said of a wide variety of skilled and unskilled trades. Even an un- 
skilled laborer who was noticeably lower on the economic scale was part 
of a community in which many others with many levels of ability lived 
close to him, literally and socially. Inevitably, with technological ad- 
vances, the niches for the less intelligent have shrunk. 

As for the most intimate affiliations—marriage and children—there 
formerly was little difference between people of varying abilities: To be 
married meant to be responsible for each other, and for the children of 


that marriage, in unqualified and uncompromising ways that the entire 
community held to be of the highest importance. Those who met those 
responsibilities had a valued place in the community by definition. 
Those who failed conspicuously in those responsibilities were outcasts 
by definition. Meeting the responsibilities of marriage and parenthood 
did not take a lot of money and did not take high intelligence. The com¬ 
munity provided clear and understandable incentives for doing what 
needed to be done. 

Urban communities were somewhat different from small towns in 
these respects but not unrecognizably so. The top socioeconomic layer 
moved off to its own part of town, but this left a broad ränge of people 
living together in the rest of a city’s neighborhoods, and the social func- 
tioning of those neighborhoods shared many characteristics with small 
towns. The responsibilities of marriage and children were as clearly de- 
fined in urban neighborhoods as in rural ones, and success and failure 
in those responsibilities were as visibly rewarded and punished. 

As for the other ways in which people found valued places for them- 
selves, urban neighborhoods teemed with useful things to do. Anyone 
who wanted to have a place in the community could find one in the lo¬ 
cal school boards, churches, Union halls, garden clubs, and benevolent 
associations of one sort or another. The city government provided the 
police who walked the local beat. It ran the courthouse and public hos- 
pital downtown, and perhaps an orphanage and a home for the aged, 
but otherwise the neighborhood had to do for itself just about every- 
thing that needed doing to keep the social contract operative and daily 
life on an even keel. Someone who was mentally a bit dull might not 
be chosen to head up the parish clothing drive but was certainly eligi- 
ble to help out. And these were just the organized aspects of commu¬ 
nity life. The unorganized web of interactions was even more extensive 
and provided still more ways in which people of all abilities, including 
those without much intelligence, could fit in. 

It is not necessary to idealize old-fashioned neighborhoods or old- 
fashioned families to accept the description we have just given. All sorts 
of human problems, from wretched marriages to neighborhood feuds 
and human misery of every other sort, could be found. Poverty was ram- 
pant (recall from Chapter 5 that more than half of the population prior 
to War II was in poverty by today’s definition). Even so, when the re¬ 
sponsibilities of marriage and parenthood were clear and uncompro¬ 
mising and when the stuff of community life had to be carried out by 
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the neighborhood or it wouldn’t get done, society was full of accessible 
valued places for people of a broad ränge of abilities, 

Finding Valued Places If You ArerCt Very Smart: The Contemporary 
Context 

Out of the myriad things that have changed since the beginning of the 
Century, two overlapping phenomena have most affected people with 
modest abilities: It has become harder to make a living to support the 
valued roles of spouse, parent, and neighbor, and functions have been 
stripped from one main source of valued place, the neighborhood* 

The ECONOMIC ARGUMENT. The cognitive elite has pulled away from 
the rest of the population economically, becoming more prosperous 
even as real wages in the rest of the economy stagnated or feil* The di- 
vergence has been most conspicuous in the lowest-skilled jobs. From 
their high point in 1973, the median earnings of full-time workers in 
general nonfarm labor had fallen by 36 percent by 1990, far more than 
for any other category* [21] A strong back isn’t worth what it used to be. 
Workers in those occupations have been demoralized. T ey have lost 
their valued place in the workplace* 

So far, we agree that economics plays an important role in taking val- 
ued places in the workplace from those with low cognitive ability. But 
the argument typically widens, asserting that economic change also ex- 
plains why people in low-skill occupations experience the loss of other 
valued places evidenced by failing marriage rates and rising illegitimacy: 
Men in low-skill jobs no longer make enough money to support a fam- 
ily, it is said. This common argument is too simplistic. In constant dol- 
lars, the income of a full-time, year-round male worker in general 
nonfarm labor in 1991 was at the level of his counterpart in 1958, when 
the norm was still one income per family, marriage rates were as high as 
ever, and illegitimacy was a fraction of its current levels* We may look 
back still further: The low-skill laborer in 1991 made about twice the 
real income of his counterpart in 1920, a year when no one thought to 
question whether a laborer could support a family, [22] Economics is rel¬ 
evant in understanding how it has become harder for people of modest 
abilities to find a valued place, and Solutions should take economics into 
account* But economics is not decisive. 

Stripping functions from the neighborhood, Communities are rieh 
and vital places to the extent that they engage their members in the 


stuff of life—birth, death, raising children, making a living, helping 
friends, singing in the local choir or playing on the softball team, cop 
ing with problems, setting examples, welcoming, chastising, celebrat 
ing, reconciling, and negotiating. 

If there is one theme on which observers from both left and right re- 
cently sound very much alike, it is that something vital and important 
has drained out of American communities . 24 Most adults need some- 
thing to do with their lives other than going to work, and that some- 
thing consists of being stitched into a fabric of family and community. 
In the preceding chapter, we alluded to the federal domination of pub¬ 
lic policy that has augmented the cognitive elite’s political leverage dur- 
ing the last thirty years. The same process has had the collateral effect 
of Stripping the neighborhood of much of the stuff of life. For what 
seemed like sufficient reasons at the time, Congress and presidents have 
deemed it necessary to remove more and more functions from the neigh- 
borhood. The entire social welfare System, Services and cash payments 
alike, may be viewed in that light. Certain tasks—such as caring for the 
poor, for example—were deemed to be too difficult or too poorly per- 
formed by the spontaneous efforts of neighborhoods and voluntary or- 
ganizations, and hence were transferred. The States have joined in this 
process. Whether federal and state policymakers were right to think that 
neighborhoods had failed and that the centralized govemment has done 
better is still a subject of debate, as is the net effect of the transfers, but 
the transfers did indeed occur and they stripped neighborhoods of tra- 
ditional functions . 25 

The cognitive elite may not detect the declining vitality in the lo¬ 
cal community. For many of them, the house is important- its size, lo- 
cation, view, grounds. They may want the right kind of address and the 
right kind of neighbors. But their lives are centered outside a geo- 
graphic community; their Professional associates and friends may be 
scattered over miles of suburbs, or for that matter across the nation and 
the world. For large segments of American society, however, the geo- 
graphic neighborhood is the major potential resource for infusing life 
with much of its meaning. Even the cognitive elite needs local com¬ 
munities, if not for itself, then for those of its children who happen not 
to land at the top of the cognitive ability distribution. The massive 
transfer of functions from the locality to the government has stripped 
neighborhoods of their traditional shared tasks. Instead, we have neigh¬ 
borhoods that are merely localities, not communities of people tend- 
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ing to their communal affairs. Valued places in a neighborhood are 
created oniy to the extent that the people in a neighborhood have 
valued tasks to do. 

People who have never lived in such a neighborhood—and as time 
goes on this includes more and more of the cognitive elite and the 
affluent in general—often find this hard to believe. It is another case 
of the isolation we discussed in Chapter 21: They may read about such 
communities in books, but surely they no longer exist in real life. But 
they do. Thumb through a few weeks’ issues of the newspaper from 
any small town, and you will find an America that is still replete with 
fund-raising suppers for the local child who has cancer, drives to 
collect food and clothing for a family that has suffered a reverse, and 
even barn raisings. They may exist as well (though they are less well 
documented) in urban working-class neighborhoods that have man- 
aged to retain their identity. It is through such activities that much of 
the real good for the disadvantaged is accomplished. Beyond that, they 
have a crucial role, so hard to see from a Washington office, of creat- 
ing ways for people of a wide level of incomes and abilities to play a 
part. It creates ways for them to be known—not just as a name but as 
a helpful fei low, a useful person to know, the woman you can always 
count on. It creates ways in which you would be missed if you were 
gone. 

Thus arises our first general policy prescription: A wide ränge of social 
functions should be restored to the neighborhood when possible and otherunse 
to the municipality . The reason for doing so, in the context of this hook, 
is not to save money, not even because such Services will be provided 
more humanely and efficiently by neighborhoods (though we believe 
that generally to be the case), but because this is one of the best ways 
to multiply the valued places that people can fill. As the chapter con- 
tinues, we will offer some other possibilities for accomplishing this and 
collateral objectives. But before arguing about how it is to be done, we 
hope that there can be wide agreement on the importance of the goal: 
In a decent postindustrial society, neighborhoods shall not have lost 
their importance as a source of human satisfactions and as a generator 
of valued places that all sorts of people can fill. Government policy can 
do much to foster the vitality of neighborhoods by trying to do less for 
them. 


SIMPL1FYING RULES 

The thesis of this section may be summarized quickly: As of the end of 
the twentieth Century, the United States is run by rules that are Gonge* 
nial to people with high IQs and that make life more difficult for every¬ 
one eise. This is true in the areas of criminal justice, marriage and 
divorce, welfare and tax policy, and Business law, among others. It is true 
of rules that have been intended to help ordinary people—rules that 
govem schooling, medical practice, the labeling of goods, to pick some 
examples. It has happened not because the cognitive elite consciously 
usurped the writing of the rules but because of the cognitive stratifica- 
tion described throughout the Book. The trend has affected not just 
rhose at the low end of the cognitive distribution but just about every- 
body who is not part of the cognitive and economic elites. 

The Systems have been created, Bit by bit, over decades, by people 
who think that complicated, sophisticated operationalizations of fair* 
ness, justice, and right and wrong are ethically superior to simple, black- 
and-white versions. The cognitive elite may not be satisfied with these 
Systems as they stand at any given point, but however they may reform 
them, the Systems are sure to become more complex. Additionally, conv 
plex Systems are precisely the ones that give the cognitive elite the great* 
est competitive advantage. Deciphering complexity is one of the things 
that cognitive ability is most directly good for. 

We have in mind two ways in which the mies generated by the cog¬ 
nitive elite are making life more difficult for everyone eise. Each requires 
somewhat more detailed explanation. 

Makmg h Easier to Make a Living 

First come all the rules that make life more difficult for people who are 
trying to navigate everyday life. In looking for examples, the 1040 in* 
come tax form is such an easy target that it need only be mentioned to 
make the point. But the same complications and confusions apply to a 
single woman with children seeking govemment assistance or a person 
who is trying to open a dry-cleaning shop. As the cognitive elite busily 
goes about making the world a better place, it is not so important to 
them that they are complicating ordinary lives. Its not so complicated 
to them. 
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The same bürden of complications that are only a nuisance ro peo- 
ple who are smart are much more of a barrier ro people who are not. In 
many cases, such barriers effectively block off avenues for people who 
are not cognitively equipped to struggle through the bureaucracy. In 
other cases, they reduce the margin of success so much that they make 
the difference between success and failure. “Sweat equity,” though the 
phrase itself has been recently coined, is as distinctively an American 
concept as “equality before the law” and “liberty.” You could get ahead 
by plain hard work. No one would stand in your way. Today that is no 
longer true. American society has erected barriers to individual sweat 
equity, by saying, in effect, “Only people who are good at navigating 
complex rules need apply.” Anyone who has tried ro open or run a small 
business in recent years can supply evidencc of how tormidable those 
barriers have become. 

Credentialism is a closely related prohlem. It goes all the way up the 
cognitive ränge—the Ph.D. is often referred to as “the union card" by 
graduate students who want to become College professors—hur ir is es- 
pecially irksome and obstructive for occupations further down the lad- 
der. Increasingly, occupations must be licensed, whether the service 
involves barbering or taking care of neighborhood children. The the- 
ory is persuasive—do you want someone taking care of your child who 
is not qualified?—but the practice typically means jumping rhrough bu- 
reaucratic hoops that have little to do with one s ability to do the job. 
The rise of licensing is both a Symptom and a cause of diminishing per¬ 
sonal ties, along with the mutual trust that goes with those ties. The li' 
censing may have some small capacity to filter out the least competent, 
but the benefits are often outweighed by the costs of the increased hm 
reaucratization. 

Enough examples. American society is rife with them. In many ways, 
life is more complicated than it used to be, and rhere’s nothing to be 
done about it. But as the cognitive elite has come to power, it has trailed 
in its wake a derritus of complexities as well, individually minor, that 
together have reshaped society so that the average person has a much 
tougher time running his own life. Our policy recommendation is to 
stop it and Strip away the nonsense. Consider the costs of complexity 
itself. Return to the assumption that in America the government has 
no business getting in people’s way except for the most compelling rea- 
sons, with “compelling” required to meet a stiff definition. 


Making It Basier to Live a Virtuous Life 

We Start with the supposition that almost everyone is capable of being 
a morally autonomous human being most of the time and given suitable 
circumstances. Political seientist James Q. Wilson has put this case elo- 
quenrly in The Moral Sense, calling on a wide ränge of social Science 
findings to support an old but lately unfashionable truth: Human beings 
in general are capable of deciding between right and wrong. 26 This does 
not mean, however, that everyone is capable of deciding between right 
and wrong with the same sophistication and nuances. The difference 
between people of low cognitive ability and the rest of society may be 
put in terms of a metaphor: Everyone has a moral compass, but some of 
those compasses are more suseeptible to magnetic storms than others. 
First, consider crime, then marriage. 

CRIMF. Imagine living in a society where the rules about crime are sim- 
ple and the consequences are equally simple. “Crime” consists of a few 
obviously wrong acts: assault, rape, murder, robbery, theft, trespass, de- 
struction of another’s property, fraud. Someone who commits a crime is 
probably caught—and almost certainly punished. The punishment al- 
most certainly hurts (it is meaningful). Punishment follows arrest 
quickly, within a matter of days or weeks. The members of the society 
subscribe to the underlying Codes of conduct with enthusiasm and near 
unanimity. They teach and enforce them whenever appropriate. Living 
in such a worid, the moral compass shows simple, easily understood di- 
rections. North is north, south is south, right is right, wrong is wrong. 

Now imagine that all the rules are made more complicated. The num¬ 
her of acts defined as crimes has multiplied, so that many things that are 
crimes are not nearly as obviously “wrong” as something like robbery or 
assault. The link between moral transgression and committing crime is 
made harder to understand. Fe wer crimes lead to an arrest. Fe wer arrests 
lead to prosecution. Many times, the prosecutions are not for something 
the accused person did but for an offense that the defense lawyer and 
the prosecutor agreed upon. Many times, people who are prosecuted are 
let off, though everyone (including the accused) acknowledges that the 
person was guilty. When people are convicted, the consequences have 
no apparent connection to how much harm they have done. These 
events are typically spread out over months and some times years. To top 
it all off, even the “wrongness” of the basic crimes is called into ques- 
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rion. ln the society at large (and translated onto the television and 
movie screens), it is commonly argued that robbery, for example, is not 
always wrong if it is in a good cause (stealing medicine to save a dying 
wife) or if it is in response to some external condition (exploitation, 
racism, etc.). At every level, it. becomes fashionable to point out the 
complexities of moral decisions, and all the ways in which things that 
might seem “wrong'’ at first glance are really “right“ when properly an- 
alyzed. 

The two worlds we have described are not far removed from the con- 
trast between the criminal justice System in the United States as re- 
cently as the 1950s and that System as of the 1990s. We are arguing that 
a person with comparatively low intelligence, whose time horizon is 
short and ability to balance many competing and complex incentives is 
low, has much more difficulty following a moral compass in rhe 1990s 
than he would have in the 1950s. Put aside your feelings about whether 
these changes in the criminal justice System represent progress. Simply 
consider them as a magnetic storm—as a set of changes that make the 
needle poinring to right and wrong waver erratically if you happen to 
be looking at the criminal justice sysrem from the perspective of a per¬ 
son who is not especially bright. People of limited intelligence can lead 
moral lives in a society that is run on the basis oPThou shalt not steal.” 
They find it much harder ro lead moral lives in a society that is run on 
the basis of “Thou shalt nor steal unless there is a really good reason 
ro.“ 1271 

The policy prescription is that the criminal justice sysrem should be 
made simpler . The meaning of criminal offenses used to he clear and ob- 
jective, and so were the consequences. It is worth trying to make them 
so again. 

MaRRIAüe. It has become much more difficult for a person of low cog- 
nitive ability to figure out why marriage is a good thing, and, once in a 
marriage, more difficult to figure out why one should stick with it 
through had times. The magnetic storm has swept through from many 
directions. 

The sexual revolution is rhe most obvious culprit. The old bargain 
from che man s point of view—get married, because that’s the only way 
you’re going to be able to sleep with the lady—was the kind of incen- 
tive that did nor require a lot of intellect to process and had an alb 


powerful effect on behavior. Restoring it is not feasible hy any (reason- 
able) policy we can think of 

Rut the state has interfered as well to make it more difficult for peo- 
ple with little intelligence to do that thing—find a compatihle partner 
and get married—that constitutes the most accessible and richest of all 
valued places. Marriage fills a vital role in people ’s lives to the extent 
that it is hallowed as an institution and as a relationship unlike any 
other. Marriage is satisfying to the extent that society validates these 
propositions: “Yes, you may have a baby outside marriage if you choose; 
hut it isn’t the same." “Yes, you may live with someone without marry- 
ing, hut it isn’t the same.“ “Yes, you may say that you are committed to 
someone without marrying, but it isn’t the same.“ 

Once sex was no longer playing as important a role in the decision to 
marry, it was essential that these other unique attributes of marriage be 
highlighted and reinforced. But the opposite has happened. Repeatedly, 
the prerogatives and responsibilities that used to be limited to marriage 
have spilled over into nonmarital relationships, whether it is the rights 
and responsibilities of an unmarried father, medical coverage for same- 
sex parrners, orpalimony cases. Once the lawsays, “Well, ina legalsense, 
living together is the same,” what is the point of getting married? 

For most people, there are still answers to that question. Even given 
the diminished legal stature of marriage, marriage continues to have 
unique value. But to see those values takes forethought about the long- 
term differences between living together and being married, sensitivity 
to many intangibles, and an appreciation of second-hand and third- 
hand consequences. As Chapter 8’s evidence about marriage rates im¬ 
plies, people low on the intelligence distriburion are less likely to think 
through those issues than others. 

Our policy prescription in this instance is to retum marriage to irs 
formerly unique legal Status. If you are married, you take on obligations. 
If you are not married, you don’t. In particular, we urge that marriage 
once again become the sole legal institution through which rights and 
responsibilities regarding children are exercised. If you are an unmar¬ 
ried mother, you have no legal basis for demanding that the father of 
the child provide support. If you are an unmarried father, you have no 
legal Standing regarding the child—not even a right to see the child, let 
alone any basis honored by society for claiming he or she is “yours” or 
that you are a “father.“ 
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We do not expect such changes miraculously to resuscitate marriage 
in the lowest cognitive classes, but they are a Step in the return to a sim- 
pler valuation of it. A family is unique and highiy desirable. To statt one, 
you have to get married. The role of the state in restoring the rewards 
of marriage is to validate once again the rewards that marriage naturaüy 
carries with it. 

More General Implications for Policy 

Crime and marriage are only examples of a general principle: Modern 
American society can be simplified. No law of nature says that the in- 
creasing complexity of technology must be matched by a new com- 
plexity in the way the nation is govemed. The increasing complexity of 
technology follows from the functions it serves. The increasing com- 
plexity of government does not. Often the complexities introduced by 
technology require highiy sophisticated anaty.sis before good law and reg- 
ulation can be developed. But as a rule of thumb, the more sophisticated 
the analysis, the simpler the policies can be. Policy is usually compli- 
cated because it has been built incrementally through a political process, 
not because it has needed to become more complicated. The time has 
come to make simplification a top priority in reforming policy—not for 
a handful of regulations but across the board. 

More broadly, we urge that it is possible once again to make a core of 
common law, combined with the original concepts of negligence and 
liability in tort law, the mechanism for running society—easily under- 
stood by all and a basis for the straightforward lessons that parents at 
all levels of cognitive ability above the lowest can teach rheir child- 
ren about how to behave as they grow up. We readily acknowledge 
that modernity requires some amplifications of this simple mechanism, 
but the nation needs to think through those amplifications from the 
legal äquivalent of zero-based budgeting. As matters stand, the legal 
edifice has become a labyrinth that only the rieh and the smart can 
navigate. 

BLANKS UNF1LLED 

We have presented what we believe needs to be done. We also under- 
stand that a common response will be incredulity, for different readers 
will Interpret the long chapters that have come before as a manifeste 
for completely different kindsofpolicy initiatives. Specifically, twolines 


of argument are likely to follow from this book. To some, we will have 
made a case for increased income redistribution. To others, we will have 
made a case for steps to manipulate the fertility of people with high 
and low IQs. We will be pleased if the book leads to a vigorous discus- 
sion of these issues, but we have just a few words to say about them here. 

Dealing with Income 

Ever since most people quit believing that a person s income on earth 
reflects God s judgment of his worth, it has been argued that income dis- 
tributions are inherently unfair; most wealthy people do not “deserve” 
their wealth nor the poor their poverty. That being the case, it is ap- 
propriate for societies to take from the rieb and give to the poor. The 
Statistical relationship we have documented between low cognitive abil¬ 
ity and income is more evidence that the world is not fair. 

Rut it is not news that the world is unfair. You knew before reading 
this book that income differences arise from many arbitrary causes, so- 
ciological and psychological, besides differences in intelligence. All of 
them are reflected in correlations of varying sizes, which mean all of 
them are riddled with exceptions. This complicates Solutions. When- 
ever individual cases are examined, differences in circumstances will be 
iound that du reflecr the individuals fault or merit. The data in this book 
support old arguments for supplementing the income of the poor with- 
out giving any new guidance for how to do it. 

The evidence about cognitive ability causes us to be sympathetic to 
the straightforward proposition that “trying hard” ought to be rewarded. 
Our prescription, horrowing from the case made by political scientist 
David Ellwood, is that people who work full time should not he too poor 
to have a decent Standard of living, even if the kinds of work they can 
do are not highiy valued in the marketplace. 2s We do not put this as a 
principle of government for all countries—getting everybody out of 
poverty is not an Option in most of the world—but it is appropriate for 
rieh countries to try to do. 

How? There is no economically perfect alternative. Any government 
Supplement of wages produc.es negative effects of many kinds. Such de- 
fects are not the results of bad policy design but inherent. The least dam- 
aging strategies are the simplest ones, which do not try to oversee or 
manipulate the labor market behavior of low-income people, but rather 
augment their earned income up to a floor. The earned income tax 
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credit, already in place, seems to be a generally good strategy, albeit with 
the unavoidable drawbacks of any income Supplement . 12 ^ 1 

We will not try to elaborate on rhese arguments here. We leave the 
income issue with this: As America enters the twenty-first Century, it is 
inconceivable that it will return to a laissez-faire System regarding in- 
come. Some sort of redistribution is here to stay. The question is how 
to redistribute in ways that increase the chances for people at the bot- 
tom of society to take control of their lives, to be engaged meaningfully 
in their communities, and to find valued places for themselves. Cash 
Supplements need not compete with that goal, whereas the social web 
fare System that the nation has developed in the twentieth Century most 
definitely does. We should be looking for ways to replace the latter with 
the formen 

Dealing with Demography 

Of all the uncomfortable topics we have explored, a pair of the most un- 
comfortable ones are that a society with a higher mean IQ is also likely 
to be a society with fewer social ills and brighter economic prospects, 
and that the most efficient way to raise the IQ of a society is for smarter 
women to have higher birth rares than duller women. Instead, Amer¬ 
ica is going in the opposite direction, and the implication is a future 
America with more social ills and gloomier economic prospects, These 
conclusions follow directly from the evidence we have presenred ar such 
length, and yet we have so far heen silent on what to do about it. 

We are silent partly because we are as apprehensive as most other 
people about what might happen when a government decides to social- 
engineer who has bahies and who doesn’t. We can imagine no recom- 
mendarion for using the government to manipulate fertility that does 
not have dangers. But this highlights the problem: The United Stares 
already has policies that inadvertently sociabengineer who has bahies, 
and it is encouraging the wrong women. If the United States did as mach 
to encourage high-IQ u’ornen to have bahies as it nou> does to encourage low- 
JQ warnen, it would rightly be descrihed as engaging in aggressive manipula- 
tion of fertility. The technically precise description of Americas fertility 
policy is that it subsidizes births among poor women, who are also dis- 
proportionately at the low end of the intelligence distribution. We urge 
generally that these policies, represented by the extensive network of 
cash and Services for low-income women who have bahies, be ended. 


The government should stop subsidizing births to anyone, rieh or poor. 
The other generic recommendation, as closc to harmless as any gov- 
ernment program we can imagine, is to make it easy for women to make 
good on their prior decision not to get pregnant by making available 
birth control mechanisms that are increasingly flexible, foolproof, in- 
expensive, and safe. 

The other demographic factor we discussed in Chapter 15 was im- 
migration and the evidence that recent waves of immigrants are, on the 
average, less successful and prohahly less able, than earlier waves. There 
is no reason to assume that the hazards associated with low cognitive 
ability in America are somehow circumvented hy having heen born 
abroad or having parents or grandparents who were. An Immigrant pop- 
ulation with low cognitive ability will—again, on the average—have 
trouble not only in finding good work but have trouble in school, at 
home, and w'ith the law. 

This is not the place, nor are we the people, to try to rewrite immi- 
gration law. But we believe that the main purpose of immigration law 
should be to serve Americas interests. It should be among the goals of 
public policy to shift the flow of immigrants away from those admitted 
under the nepotisric rules (which broadly encourage the reunification 
of relatives) and toward those admitted under competency rules, already 
established in immigration law—not to the total exclusion of nepotis- 
tic and humanitarian eriteria but a shift. Perhaps our central thought 
ahouv immigration is that present policy assumes an indifference to the 
individual eharacteristics of immigrants that no society can indefinitely 
maintain without danger. 

CON CLUS ION 

Hundreds of pages ago, in the Preface, we reflected on the question that 
we have heen asked so often, “What good can come from writing this 
hook?” We have tried to answer it in many ways. 

Our first answer has heen implicit, scattered in material throughout 
the hook. For thirty years, vast changes in American life have heen in- 
stituted by the federal government to deal with social prohlems. We 
have tried to point out what a small segment of the population accounts 
for such a large proportion of those prohlems. To the extent that the 
Problems of this small segment are susceptible to social-engineering so- 
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lutionsat all, they should be highly targeted.The vast majority of Amer- 
icans can run their own lives just fine, and policy should above all be 
constructed so that it permits them to do so. 

Our second answer, also implicit, has been that just about any policy 
in any area—education, employment, welfare, criminal justicc, or the 
care of children—can profit if its designers ask how the policy accords 
with the wide Variation in cognitive ability. Policies may fail not be- 
cause they are inherently flawed but because they do not inake ab 
lowances for how much people vary. There are hundreds of ways to frame 
bits and pieces of public policy so that they are based on a realistic ap- 
praisal of the responses they will get not from people who think like 
Rhodes scholars but people who think in simpler ways. 

Our third answer has gone to specific issues in raising the cognitive 
functioning of the disadvantaged (Chapter 17) and in improving edu- 
cation for all (Chapter 18). Part of our answer has been cautionary: 
Much of public policy toward the disadvantaged Starts from the prcmise 
that interventions can make up for genetic or environmental disad- 
vantages, and that premise is overly optimistic. Part of our answer has 
been positive: Much can and should be done to improve education, es- 
pecially for those who have the greatest potential. 

Our fourth answer has been that group differences in cognitive abil¬ 
ity, so desperately denied for so long, can best be handled—can oiüy be 
handled—by a retum to individualism. A person should not be judged 
as a member of a group but as an individual. With that comerstone of 
the American doctrine once again in place, group differences can take 
their appropriately insignificant place in affecting American life. Rut 
until that cornerstone is once again in place, the anger, the hurt, and 
the animosities will continue to grow. 

In this closing chapter, we have focused on another aspect of what 
makes America special. This most individualistic of nations contains 
one of the friendliest, most eager to oblige, neighborly peoples in all the 
world. Visitors to America from Tocqueville on down have observed it. 
As a by-product of this generosity and civic mindedness, America has 
had a genius for making valued places, for people of all kinds of abili- 
ties, given only that they played by a few basic rules. 

Once we as a nation absorbed people of different cultures, abilities, 
incomes, and temperaments into communities that worked. The nation 
was good at it precisely because of, not in spite of, the freedom that 


American individuals and communities enjoyed. Have there been ex- 
ceptions to that generalization? Yes, predominantly involving race, and 
the nation rightly moved to rid itself of the enforced discrimination that 
lay hehind those exceptions. Is the generalization nonetheless justified? 
Overwhelmingly so, in our judgment. Reducing that freedom has ener- 
vated our national genius for finding valued places for everyone; the ge¬ 
nius will not be revitalized until the freedom is restored. 

Cognitive partitioning will continue. It cannot be stopped, because 
the forces driving it cannot be stopped. But America can choose to pre- 
serve a society in which every Citizen has access to the central satisfac- 
tions of life. Its people can, through an interweaving of choice and 
responsibility, create valued places for themselves in their worlds. They 
can live in communities—urhan or rural—where being a good parent, 
a good neighbor, and a good friend will give their lives purpose and 
meaning. They can weave the most crucial safety nets together, so that 
their mistakes and misfortunes are mitigatecl and withstood with a lit- 
tle help from their friends. 

AU of these good things are available now to those who are smart 
enough or rieh enough—if they can exploir the complex rules to their 
advantage, buy their way out of the social institutions that no longer 
function, and have access to the rieh human interconnections that are 
growing, not diminishing, for the cognitive ly fortunate. We are ca Hing 
upon our readers, so heavily concentrated among those who fit that de- 
scription, to recognize the ways in which public policy has come to deny 
those good things to those who are not smart enough and rieh enough. 

At the heart of our thought is the quest for human dignity. The cen¬ 
tral me as ure of success for this government, as for any other, is to per- 
mir people to live lives of dignity—not to gwe them dignity, for that is 
not in any governments power, but to make it accessible to all. That is 
one way of thinking ahout what the Founders had in mind when they 
proclaimed, as a truth self-evident, that all men are created equal. That 
is what we have in mind when we talk about valued places for everyone. 

lnequality of endowments, including intelligence, is a reality. Trying 
to pretend that inequality does not real ly exist has led to disaster. Try¬ 
ing to eradicate inequality with artificially manufactured outcomes has 
led to disaster. It is time for America once again to try living with in¬ 
equality, as life is lived: understanding that each human being has 
strengths and weaknesses, qualities we admire and qualities we do not 
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admire, c.ompetencies and incompetences, assets and debits; that the 
success of each human life is not measured extemally hut internally; 
that of all the rewards we can confer on each other, the most precious 
is a place as a valued fellow Citizen. 


Appendix 1 

Statistics for People Who Are 
Sure They Can’t Learn Statistics 


The short explanations of Standard deviation (page 44), correlation 
(page 67), and regression (page 122) should be saiisfacrory for peoplc 
who are at home with math hut never took a statistics course. The longer 
explanations in this appendix are for peuple who would like to unden 
stand what distribution , Standard deviation, correlation, and regres.sion 
mean, hut who are not at home with math. 

DISTRIBUTIONS AND STANDARD DEVIATIONS 

Why Do We Need “Standard Deviation”? 

Every day, formally or informally, people make comparisons—among 
people, among apples and oranges, among dairy cows or eggdayinghens, 
among the screws heing coughed out hy a screw machine. The Standard 
deviation is a measure of how spread out the things being compared are. 
“This egg is a lot bigger than average,” a chicken farmcr might say. The 
Standard deviation is a way of saying precisely what “a lot” means. 

What Is a Frequency Distribution? 

To get a clear idea of what a frequency distribution is, imagine yourself 
back in your high school gym, with all the boys in the senior dass in the 
school gym assembled before you (including both sexes would complb 
cate matters, and the point of this discussion is to keep things simple). 
Line up these boys from left to right in order of height. 

Now you have a long line going from shortest to tallest. As you look 
along the line you will see that only few boys are conspicuously short 
and tall. Most are in the middle, and a lot of them scem identical in 
height. Is there any way to get a better idea of how this pattem looks? 



Tape a series of cards to the floor in a straight line from left to right, 
with 60 inches and shorter” written on the one at the far left, “80 inches 
and taller” on the card at the far right, and cards in kinch increments 
in between. Teil everyone to stand behind the card that corresponds to 
his height. 

Someone loops a rope over the rafters and pulls you up in the air so 
you can look straight down on the tops of the heads of your classmates 
Standing in their single files behind the height labels. The figure below 
shows what you see: a frequency distribution* 1 What good is it? Look 

The raw material of a frequency distribution 


****** •• ••• ••• 

***** •• •• ••••••• • 

60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 

ing at your high school classmates Standing around in a mob, you can 
teil very little about their height. Looking at those same classmates 
arranged into a frequency distribution, you can teil a lot, quickly and 
memorably. 

How Is the Distribution Related to the Standard Deviation1 

We still lack a convenient way of expressing where people are in that 
distribution. What does it mean to say that two different students are, 
say, 6 inches different in height. How “big” is a 6-inch difference? That 
brings us back to the Standard deviation. 

When it comes to high school students, you have a good idea of how 
big a 6-inch difference is. But what does a 6-inch difference mean if you 
are talking about the height of elephants? About the height of cats? It 
depends. And the things it depends on are the average height and how 
much height varies among the things you are measuring. A Standard de¬ 
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viation givesyou a way oftaking both the average and that variability into ac - 
count, so that “6 inches” can be expressed in a way that means the same thing 
for high school students relative to other high school students , elephants rela¬ 
tive to other elephants, and cats relative to other cats . 

How Do You Compute a Standard Deviation? 

Suppose that your high school dass consisted of just two people who 
were 66 inches and 70 inches* Obviously, the average is 68 inches. Just 
as obviously, one person is 2 inches shorter than average, one person is 
2 inches taller than average* The Standard deviation is a kind of aver¬ 
age of the differences from the mean—2 inches, in this example* Sup¬ 
pose you add two more people to the dass, one who is 64 inches and the 
other who is 72 inches* The mean hasn’t changed (the two new people 
balance each other off exactly)* But the newcomers are each 4 inches 
different from the average height of 68 inches, so the Standard devia¬ 
tion, which measures the spread, has gotten bigger as well. Now two 
people are 4 inches different from the average and two people are 2 
inches different from the average. That adds up to a total 12 inches, di- 
vided among four persons. The simple average of these differences from 
the mean is 3 inches (12-5-4), which is almost (but not quite) what the 
Standard deviation is. To be precise, the Standard deviation is calculated 
by squaring the deviations from the mean, then summing them, then 
finding their average, then taking the square root of the result* In this 
example, two people are 4 inches from the mean and two are 2 inches 
from the mean. The sum of the squared deviations is 40 (16 + 16 + 4 
+ 4). Their average is 10 (40 + 4)* And the square root of 10 is 3.16, 
which is the Standard deviation for this example* The technical reasons 
for using the Standard deviation instead of the simple average of the de¬ 
viations from the mean are not necessary to go into, except that, in nor¬ 
mal distributions, the Standard deviation has wonderfully convenient 
properties. If you are looking for a short, easy way to think of a Standard 
deviation, view it as the average difference from the mean* 

As an example of how a Standard deviation can be used to compare 
apples and oranges, suppose we are comparing the Olympic women’s 
gymnastics team and NBA basketball teams. You see a woman who is 5 
feet 6 inches and a man who is 7 feet. You know from watching gym¬ 
nastics on television that 5 feet 6 inches is tall for a woman gymnast, 
and 7 feet is tall even for a basketball player* But you want to do better 
than a general impression* Just how unusual is the woman, compared to 
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the average gymnast on che U.S. women’s team, and how unusual is the 
man, compared to the average basketball player on the U.S. mens team? 

We gather data on height among all the women gymnasts, and de- 
termine that the mean is 5 feet 1 inches with a Standard deviation (SD) 
of 2 inches. For the men basketball players, we find that the mean is 6 
feet 6 inches and the SD is 4 inches. Thus the woman who is 5 feet 6 
inches is 2.5 Standard deviations taller than the average; the 7Tont man 
is only 1.5 Standard deviations taller than the average. These numbers— 
2.5 for the woman and 1.5 for the man—are called Standard scores in 
Statistical jargon. Now we have an explicit numerical way ro compare 
how different the two people are from their respective averages, and we 
have a basis for concluding that the woman who is 5 feet 6 inches is a 
lot taller relative to other female Olympic gymnasts than a 7-foot man 
is relative to other NBA basketball players. 

How Much More Different? Enter the Noimal Distribution 

Even before coming to this book, most readers had heard the phrases 
normal distribution or belbshaped curve , or, as in our title, bell curve . They 
refer to a common way that natural phenomena arrange themselves ap- 
proximately. (The true normal distribution is a mathematical abstrac- 
tion, never perfectly observed in nature.) lf you look again at the 
distribution of high school boys that opened the discussion, you will 
see the makings of a bell curve. If we added several thousand more boys 
to it, the kinks and irregularities would smooth out, and it would actually 
get very close to a normal distribution. A perfect one is in the figure 
below. 



It makes sense that most things will be arranged in bell-shaped curves. 
Extremes tend to be rarer than the average. If that sounds like a rautob 
ogy, it is only because bell curves are so common. Consider height again. 
Seven feet is “extreme” for humans. But if human height were distrib- 
uted so that equal proportions of people were 5 feet, 6 feet, and 7 feet 
tall, the extreme would not be rarer than the average. It just so happens 
that the world hardly ever works that way, 

Bell curves (or close approximations to them) are not only common 
in nature; they have a close mathematical affinity to the meaning of the 
Standard deviation. In any true normal distribution, no matter whether 
the elements are the heights of basketball players, the diameters of screw 
heads, or the milk production of cows, 68.27 percent of all the cases fall 
in the interval between 1 Standard deviation above the mean and 1 stan- 
dard deviation below it. It is worth pausing a momenr over this link be- 
tween a relatively simple measure of spread in a distribution and the way 
things in everyday life vary, for it is one of nature’s more remarkable uni- 
formities. 

In its mathematical form, the normal distribution extends to infin- 
ity in both directions, never quite reaching the horizontal axis. But for 
practica! purposes, when we are talking about populations of people, a 
normal distribution is about 6 Standard deviations wide. The next fig- 
ure shows how the bell curve looks, cut up into six regions, each marked 

A bell curve cut into Standard deviations 



Standard deviations from the mean 


by a Standard deviation unit. The ränge within ± 3 Standard deviation 
units includes 99.7 percent of a population that is distrihuted normally. 

We can squeeze the axis and make it look narrow, or Stretch it out 
and make it look wide, as shown in the following figure. Appearances 
notwithstanding, the mathematical shape is not really changing. The 
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Standard deviations cut off the same portions of the population for 
any normal distribution 



Standard deviation continues to chop off proportionately the same size 
chunks of the distribution in each case. And therein lies its value. The 
Standard deviation has the same meaning no matter whether the dis- 
trihution is tall and skinny or short and wide. 

Furthermore, there are some simple characteristics about these scores 
that make them especially valuable. As you can see by looking at the 
figures above, it makes intuitive sense to think of a 1 Standard deviation 
difference as “large,” a 2 Standard deviation difference as “very large,” 
and a 3 Standard deviation difference as “huge.” This is an easy metric 
to remember. Specifically, a person who is 1 Standard deviation above 
the mean in IQ is at the 84th percentile. Two Standard deviations above 
the mean puts him at the 98th percentile. Three Standard deviations 
above the mean puts him at the 99.9th percentile. A person who is 1 
Standard deviation below the mean is at the 16th percentile. Two Stan¬ 
dard deviations below the mean puts him at the 2d percentile. Three 
Standard deviations below the mean puts him at the 0.1 th percentile. 

Why Not Just Öse Percentiles to Begin With? 

Why go to all the trouhle of computing Standard scores? Most people 
understand percentiles already. Teil them that someone is at the 84th 
percentile, and they know right away what you mean. Teil them that 
hes at the 99th percentile, and they know what that means. Aren’t we 
just mtroducing an unnecessary complication by talking about “Stan¬ 
dard scores”? 

Thinking in terms of percentiles is convenient and has its legitimate 


uses. We often speak in terms of percentiles—or eentiles—in the text. 
But they can also he highly misleading, hecause they are artificially com- 
pressed at the tails of the distributions. It ls a longer way from, say, the 
98th centile to the 99th than from the 50th to the 5Ist. In a true nor¬ 
mal distribution, the distance from the 99th centile to the lOOth (or, 
similarly, from the Ist to the Oth) is infinite. 

Consider two people who are at the 50th and 55rh eentiles in height. 
Using the NLSY as our estimate of the national American distribution 
of height, their aetual height difference is only half an inch. 121 Consider 
anorher rwo people who are at the 94th and 99th eentiles on height— 
the identical gap in terms of eentiles. Their height difference is 3.1 
inches, six times the height difference of those at the 50th and 55th 
eentiles. The further out on the tail of the distribution you move, the 
more misleading eentiles become. 

Standard scores reflect these real differences much more accurately 
than do eentiles. The people at the 50th and 5 5th eentiles, only half an 
inch apart in real height, have Standard scores ot 0 and .13. Compare 
that difference of .13 Standard deviation to the Standard scores of those 
at the 94th and 99th eentiles: 1.55 and 2.33, respectively. In Standard 
scores, their difference—which is .78 Standard deviation—is six times 
as large, reflecting the six-fold difference in inches. 

The same logic applies to intelligence test scores, and it explains why 
they should be analyzed in terms of Standard scores, not eentiles. There 
is a lot of difference between people at the Ist centile and the 5th, or 
between those at the 95th and the 99th, much more than those at the 
48th and the 52d. If you doubt this, ask a university teacher to compare 
the classroom performance of students with an SAT-Verbal of 600 and 
those with an SAT-Verbal of 800. Both are in the 99th centile of all 18- 
year-olds—but what a difference in verbal ability! 1 * 1 

CORRELATION AND REGRESSION 

We now need to consider dealing with the relationships between two or 
more distributions—which is, after all, what scientists usually want to 
do. How, for example, is the temperature of a gas related to its volume? 
The answer is Boyle s Law, which you learned in high school science. In 
social science, the relationships between variables are less clear cut and 
harder to unearth. We may, for example, be interested in wealth as a vari¬ 
able, but how shall wealth be measured? Yearly income? Yearly income 



560 Appendix I 


App endix l 561 


averaged over a period of years? The value of ones savings or possessions? 
And wealth, compared to many of thc other things social Science vvould 
fike to understand, is easy, reducihle as it is to dollars and cents. 

Rur beyond the problem of measurement, social Science must cope 
with sheer complexity. Our physical scientist colleagues may not agree, 
but we believe it is harder to do seience on human affairs rhan on inan- 
imate objects—so hard, in fact, that many people consider it impossi- 
ble. We do not believe it is impossible, but it is rare that any human or 
social relationship can be fully captured in terms of a single pair of vari¬ 
ables, such as that between the temperature and volume of a gas. ln so¬ 
cial Science, multiple relationships are the rule, not the exception. 

For both of these reasons, the relations between social Science vari¬ 
ables are typically less than perfect. They are often weak and uncertain. 
Rut they are nevertheless real, and, with the right methods, they can he 
rigorously examined. 

Correlation and regression, used so often in the text, are the primary 
ways to quantify weak, uncertain relationships. For that reason, the ad- 
vances in correlational and regression analysis since the late nineteenth 
Century have provided the impetus to social Science. To understand 
what this kind of analysis is, we need to introduce the idea of a scatter 
diagram. 

Scatter Diagrarns 

We left your male high school classmates lined up hy height, with you 
looking down from the rafters. Now imagine another row of cards, laid 
out along the floor at a right angle to the ones for height. This set of 
cards has weights in pounds on them. Start with 90 pounds for the dass 
shrimp, and in 10-pound increments, continue to add cards until you 
reach 250 pounds to make room for the dass giant . Now ask your class- 
mares to find the point on the floor that corresponds to both rheir height 
and weighr (perhaps they’ll insist on a grid of intersecting lines ex- 
tending from rhe two rows of cards). When the traffic on the gym floor 
ceases, you will see something like the figure below. This is a scatter di¬ 
agram. Some sorr of relationship between height and weight is imme- 
diately obvious. The heaviest boys tend to he the tallest, the lighrest 
ones the shortest, and most of them are intermediate in both height and 
weighr. Equally obvious are the deviarions from the trend that link 
height and weight. The stocky boys appear as points ahove the mass, 


A scatter diagram 
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the skinny ones as points below it. What we need now is some way to 
quantify both the trend and the exceptions. 

Correlations and regressions accomplish. this in different ways. But be- 
fore we go on to discuss these terms, be reassured that they are simple. 
Look at the scatter diagram. You can see by the dots that as height in- 
creases, so does weight, in an irregulär way. Take a pencil (literally or 
imaginarily) and draw a straight, sloping line through the dots in a way 
that seems to you to best reflect this upward-sloping trend. Now con¬ 
tinue to read, and see how well you have intuitively produced the result 
of a correlation coefficient and a regression coefficient. 

The Correlation Coefficient 

Modern Statistics provides more than one method for measuring corre- 
lation, but we confine ourselves to the one that is most important in 
both use and generality: the Pearson product-moment correlation coet- 
ficient (named after Karl Pearson, the English mathematician and bio- 
metrician). To get at this coefficient, let us first replot the graph of the 
dass, replacing inches and pounds with Standard scores. The variables 
are now expressed in general terms. Remember: Any set of measure- 
ments can be transformed similarly. 
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The next srep on our way to the correlation coefficienr is to apply a 
formula (here dispensed wich) that, in effect, finds the best possihle 
straight line passing through the cloud of points—the mathematically 
“best” Version ot the line you just drew by inruition, 

What makes it the “best”? Any line is going to be “wrong” for most 
of the points. For example, look at the weights of the boys who are 64 
inches tall. Any sloping straight line is going to cross somewhere in the 
middle of those weights and may not cross any of the dots exactly. For 
boys 64 inches tall, you want the line to cross at the point where the to¬ 
tal amount of the error is as small as possihle. Taken over all the boys at 
all the heights, you want a straight line that makes the sutn of all the 
errors for all the heights as small as possihle. This “best fit” is shown in 
the new version of the scatter diagram below, where both height and 
weight are expressed in Standard scores and the mathematical hest-fii- 
ting line has been superimposed. 

The “best-fit” line for a scatter diagram 


Weight, expressed in Standard scores 
4- 
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Height, expressed in Standard scores 


This scatter diagram has (partly by serendipity) many lessons to teach 
about how staristics relate to the real world. Here are a fevv of the main 
ones: 

1. Notice the many exceptions. There is a statisrically substantial rela- 
tionship between height and weight, but, visually, the exceptions 
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seem todominate. So too with virtually all Statistical relationships 
in the social Sciences, most of which are much weaker than this 
one. 

2. Linear relationships dont always seem to fit very well. The best-fit 
line looks as if it is too shallow. Look at the tall boys, and see how 
consistently it underpredicts how much they weigh. Given the in- 
formation in the diagram, this mightbe an optical illusion—many 
of the dots in the dense part of the ränge are on top of each other, 
as it were, and thus it is impossihle to grasp visually how the er- 
rors are adding up—but it could also be that the relationship be- 
tween height and weight is not linear. 

3. Small samples have hulwidual anornalies . Betöre we jump to the con- 
clusion that the straight line is not a good representation of the 
relationship, remember that the sample consists of only 250 boys. 
An anomaly of this particular small sample is that one of the boys 
in the sample of 250 weighed 250 pounds. Eighteen-year-old 
boys are very rare ly that heavy, judging from the entire NLSY sam¬ 
ple, fewer than one per 1,000. And yet one of those rarities hap- 
pened to be picked up in a sample of 250. That’s the way samples 
work. 

4. But small samples are also surfmsingly accurate , despite their individ¬ 
ual anornalies. The relationship between height and weight shown 
by the sample of 250 18-year-old males is identical to the third 
decimal place with the relationship among all 6,068 males in the 
NLSY sample.* 41 This is closer than we have any right to expect, 
but other random samples of only 250 generally produce correla- 
tions that are within a few hundredths of the one produced by the 
larger sample. (There are mathematics for figuring out what “gen¬ 
erally” and “within a few hundredths” mean, but we needn’t worry 
about them here.) 

Hearing these basics in mind, let us go back to the sloping line in the 
figure above. Out of mathematical necessity, we know several things 
about it. First, it must pass through the interscction of the zeros (which, 
in Standard scores, correspond to the averages) for both height and 
weight. Second, the line would have had exactly the same slope had 
height been the vertical axis and weight the horizontal one. Finally, and 
most significant, the slope of the best-fitting line cannot be steeper than 
1.0. The steepest possihle best-fitting line, in other words, is one along 
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which one unit of change in height is exactly matched hy onc unir of 
change in weight, clearly not the case in these data. Real data in the so¬ 
cial Sciences never yield a slope that steep. 

ln the picture, the line goes uphill to the right, hut for other pairs of 
variables, it could go downhill. Consider a scatter diagram for, say, 
educational level and fertility by the age of .30. Women vvith more 
education tend to have fewer habies when they are young, eompared to 
women with less education, as we discuss in Chapters 8 and 15. The 
cloud of points would decline from left to right, just the reverse of the 
cloud in the picture above. The downhill slope of the best-fitting line 
would be expressed as a negative number, but, again, it could he no 
steeper than -1.0. 

We focus on the slope of the best-fitting line because it is the corre- 
lation coefficient—in this case, equal to .50, which is quite large by the 
Standards of variables used by social scientists. The closer it gets to ± 1.0, 
the strenger is the linear relationship between the Standardized vari¬ 
ables (the variables expressed as Standard scores). When the two vari¬ 
ables are mutually independent, the best-fitting line is horizontal; hence 
its slope is 0. Anything other than 0 signifies a relationship, albeit pos- 
sibly a very weak one. 

Whatever the correlation coefficient of a pair of variables is, squar- 
ing it yields another notable number. Squaring .50, for example, gives 
.25. The significance of the squared correlation is that it teils how mach 
the Variation in weight would decrease if we could make everyone the 
same height, or vice versa. If all the boys in the dass vvere the same 
height, the Variation in their weights would decline by 25 percent. Per- 
haps, if you have been compelled to be around social scientists, you have 
heard the phrase “explains the variance," as in, for example, “Education 
explains 20 percent of the variance in income.” That figure comes from 
the squared correlation. 

ln general , rhe squared correlation is a measure of the mutual reduiv 
dancy in a pair of variables. If they are highly correlated, they are highly 
redundant in the sense that knowing the value of one of them places a 
narrow ränge of possibilities for the value of the other. If they are uiv 
correlated or only slightly correlated, knowing the value of one teils us 
nothing or little about the value of the other.* 51 


Regression C oefficients 

Correlation assesses the strength of a relationship hetween variables. 
But we may want to know more about a relationship than merely its 
strength. We may want to know what it is. We may want to know how 
much of an increase in weight, for example, we should anticipare if we 
compare 66-inch boys with 7 3-inch boys. Such questions arise naturally 
if we are trying to explain a particular variable (e.g., annual income) in 
terms of the effects of another variable (e.g., educational level). How 
much income is another year of schooling worth ! is just the sort of ques- 
tion that social scientists are always trying to answer. 

The Standard merhod for answering it is regression analysis, which 
has an inrimate mathematical association with correlational analysis. If 
we had left the scatter diagram with its original axes—inches and 
pounds—instead of standardizing them, the slope of the best-fitting line 
would have been a regression coefficient, rather than a correlation co- 
effieieni. The figure below shows the scatter diagram with nonstan- 
dardized axes. 


What a regression coefficient is telling you 


Weight in pounds 

2^0 _ How much does height 
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Why are there two lines? Recall that the bestYitting line is the one 
that minimizes che aggregated distances between the data points and 
the line. For Standardized measurements, it makes no differenee whether 
the distances are measured along the pounds axis or the inches axis; for 
unstandardized measurements, it may make a differenee. Hence \ve may 
get two lines, depending on which axis was used to fit the line. The two 
lines, which always intersect at the average values for the two variables, 
answer different questions. One answers the question we first posed: 
How much of a differenee in pounds is associated with a given differ- 
ence in inches (i.e., the regression of weight on height). The other one 
teils us how much of a differenee in inches is associated with a given dif- 
ference in pounds (i.e., the regression of height on weight). 

Multiple Regression 

Multiple regression analysis is the main way that social science deals 
with the multiple relationships that are the rule in social science. To get 
a fix on multiple regression, let us return to the high school gym for the 
last time. Your classmates are still scattered about the floor. Now imag- 
ine a pole, ereeted at the intersection of 60 inches and 90 pounds, 
marked in inches frorn 18 inches to 50 inches. Forsome inscrutable rea- 
son, you would like to know the impact of both height and weight on a 
boys waist size. Since imagination can defy gravity, you ask each boy to 
levitate until the soles of bis shoes are at the elevation that reads on the 
pole at the waist size of his trousers. In general, the raIler and heavier 
boys must rise the most, the shorter and slighter ones the least, and most 
boys, middling in height and weight, will have middling waist sizes as 
well. Multiple regression is a mathematical procedure for finding that 
plane, slicing through the space in the gym, that minimizes the aggre- 
gated distances (in this instance, along the waist size axis) between the 
bottoms ot the boys’ shoes and the plane. 

The bestTitting plane will tilt upward toward heavy weights and tall 
heights. But it may tilt more along the pounds axis than along the inches 
axis, or vice versa. It may tilt equally for each. The slope of the tilt along 
each of these axes is again a regression coefficient. With two variables 
predicting a third, as in this example, there are two coefficients. One of 
them teils us how much of an increase in trouser waist size is associated 
with a given increase in weight, holding height constant; the other, how 


much of an increase in trouser waist size is associated with a given in' 
crease in height, holding weight constant . 

With two variables predicting a third, we reach the limit of visual 
imagination. But the principle of multiple regression can he extended 
to any number of variables. Income, for example, may be related not just 
to education but also to age, f'aimly Background, IQ, Personality, husi- 
ness conditions, region of the counrry, and so on. The mathematical 
procedures will yield coefficients for each of them, indicaring again how 
much of a change in income can he anticipated for a given change in 
any particular variable, with all the orhers hold constant. 

Logis tic Regression 

The text frequently resorts to a method of analysis ealled logistic regres- 
sion. Here, we need only say whar the method is for rather than what it 
is. Many of the variables we discuss are such rhings as heing unemployed 
or not, being married or not, heing a patent ot not, and so on. Because 
they are measured in two values—corresponding to yes and no—they 
are ealled binary variables. Logistic regression is an adaptation ot ordi- 
nary regression analysis tailored to the case of binary variables. (It can 
also be used for variables with larger numhers of discrere values.) h teils 
us how much change there is in the probability ot being unemployed, 
married, and so forth, given a unir change in any given variable, Hold¬ 
ing all other variables in the analysis constant. 


Appendix 2 


Technical Issues Regarding the 
National Longitudinal Survey of 

Youth 


This appendix provides details about the variables used in the text and 
abont other technical issues associated with the NLSY. 1 Colleagues who 
wish to recreate analyses will need additional information, which may 
be obtained from the authors. 2 


SURVEY YEAR, CONSTANT DOLLARS, AND SAMPLE WEIGHTS 

Our use of the NLSY extends through the 1990 survey year. 1 ' 1 

All dollar figures are expressed in 1990 dollars, using the consumer 
price index inflators as reported in the 1992 edition of Statistical Abstract 
of the United Sattes, Table 737. 

Sample weights were employed in all analyses in the main text. We 
do not so note in each instance, to simplify the description. ln Com¬ 
puting scores that were based on the 11,878 subjects who had valid 
scores on the Armed Forces Qualification Test (AFQT), we used the 
sampling weights specifically assigned for the AFQT population. For 
analyses based on the NLSY subjects’ Status as of a given year (usually 
1990), we used the sampling weights for that survey year. For analyses 
in which the children of NLSY women were the unit of analysis, the 
child’s sampling weights were used rather than the mother’s. 

To make Interpretation of the Statistical significance easier, we repli- 
cated all the analyses in Part II using just the unweighted cross-sectional 
sample of whites, as reported in Appendix 4. 
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SCORING OF THE ARMED FORCES QUALIFICATION TEST 
(AFQT) 

The AFQT is a comhination of highly g-loaded suhiests trom the ArmeJ 
Services Vocational Aptitude Battery (ASVAB) thar serves as rhe 
armed Services’ measure of cognitive ability, described in detail in Ap¬ 
pendix 3. Until 1989, the AFQT consisted the summed raw scores of 
the ASVAB’s arithmetic reasoning, word knowledge, and paragraph 
comprehension suhtests, plus half of the score on numerieal operations 
subtest. In 1989, the armed forces decided to rescore the AFQT so thar 
it consisted of the word knowledge, paragraph comprehension, aritlv 
metic reasoning, and mathematics knowledge suhtests. The reason Kä¬ 
the change was to avoid the numerieal operations subtest, which was 
hoth less high ly g-loaded than the mathematics knowledge subtest and 
sensitive to small discrepancies in rhe time given to subjects when ad- 
ministering the test (numerieal operations is a speeded test in which the 
subject compl et es as many arithmetic Problems as possible within a time 
limit). 

A draft of The Bell Curie was well underway when we became aware 
of the 1989 scoring scheme. We completed a full draft using the 1980 
scoring System but decided that the revised scoring System was psycho- 
metrically Superior to the old one and therefore replicated all of the 
analyses using rhe 1989 Version. 

Scholars who wish to replicate our analyses should note that the 1989 
AFQT score as reported in the NLSY database is not the one used in the 
text. The NLSY’s variable is rounded to rhe nearest whole centile and 
based on the 18- to 2 3-year-old subset of the NLSY sample. We recom- 
puted the AFQT from scrateh using rhe raw subtest scores, and the pop- 
ulatitm mean and Standard deviation used in producing the across-ages 
AFQT score was based on all 1 1,878 subjects, not just rhose ages 18 to 
23. Hl This measure is useful for multivariate analyses in which age is also 
entered as an independent variable but should not he used (and is never 
ustd in the text) as a representation of an individual subject’s cognitive 
abihty because of age-related differences in test scores (see discussion 
below). 

Age 

AFQT scores in the NLSY sample rose by an average of .07 Standard de- 
viattons per year. The simplesr explanation for this is that the AFQT 


was designed by the military for a population of recruits who would he 
taking the test in their lare reens, and younger subjects in the NLSY 
sample got lower scores for the same reason that high school freshtnen 
get lower SAT scores than high school seniors. However, a cohort ef- 
fect eould also he at work, whereby (because of educational or broad en¬ 
vironmental reasons) youths born in the first half of the 1960s had lower 
realized cognitive ability than youths born in the last half of the 1950s. 
The re is no empirical way of telling which reason really explains the 
age-related differences in the AFQT or what the mix of reasons tnight 
he. The age-related increase is not perfectly linear (it levels off in the 
top two years) but close enough that the age problem is best handled in 
the multivariate analyses by entering the subject’s birthdate as an inde¬ 
pendent variable (all the NLSY sample took the AFQT within a few 
months of each other in late 1980), 

For all analyses except the multivariate regression analyses, we use 
age-equated scores. These were produced by using the sample weight as 
a frequency, then preparing separate distributions by birth year, ex- 
pressed in centiles. N Each subject’s rank in that population (mathe- 
matically, the “population” is the sum of the sample weights for that 
birth year) was divided by the population to obtain the centile where 
thar subject feil within his birth year cohort. 161 

That AFQT scores vary according to education raises an additional 
issue: To what extent is the AFQT a measure of cognitive ability, and 
not just length and quality of education? We explore this issue at length 
in Appendix 3. 

Skew 

The distribution of the AFQT in either of its versions is skewed so that 
the high scores tend to be more closely bunehed than the low scores. 
To put it roughly, the most intelligent people who take the test have less 
of an opportunity to get a high score than the least intelligent people 
have to get a low score. One effect is to limit artificially the maximum 
size of a Standardized score. It is artificial because the AFQT does in fact 
discriminate reasonably well at the high end of the scale. For example, 
only 22 youths out of 11,878 in the NLSY with valid AFQT scores 
earned perfect scores on the suhtests, representing 0.253 percent of the 
national population of their age (using sampling weights). In a test with 
a normal distribution, those youths would have had a standardized score 






572 Appendix 2 


Appendix 2 573 


of 2.80. But given the skew in the NLSY, it is impossihle for anyone to 
have a standardized score higher than 1.66. The Standard deviation for 
a high-scoring group is similarly squeezed. 

A certain amount of skew is not a concem for many kinds of analy- 
sis. For the analyses in The Bell Curve, however, the difference hetween 
two groups is otten expressed in terms of Standard deviations, and the 
size of that difference was likely to he affected by skew. 

We therefore computed standardized scores corrected for skew, first 
by computing the centile scores for the NLSY population, using sample 
weights as always, then assigning to each suhject the standardized score 
corresponding to that centile in a normal distribution. We did this for 
both the old and new versions of the AFQT. Following armed forces’ 
convention, all scores greater or smaller than 3 Standard deviations from 
the mean were set at 3 Standard deviations (this affected only a small 
number of scores at the low end of the distribution). 

The effects of correcting for skew were noticeable when expressing 
differences hetween groups. For example, for the most sensitive group 
comparison, hetween ethnic groups, the results are shown in the fob 
lowing table. As always when full information about means, Standard 


Comparison of Two Versions of The AFQT, 
Uncorrected and Corrected for Skew 

Black/ Latino/ 

Version of Corrected White Dif-White Dif- 


rhe AFQT for Skew? 

Black 

Latino 

White 


ference 

ferenci 


Mean 

SD 

Mean 

SD 

Mean 

SD 



IV-1989 No 

-.97 

.91 

-.67 

1.01 

.24 

.88 

1.36 

1.02 

Yes 

-.90 

.81 

-.64 

.93 

.23 

.92 

1.25 

.94 

1989 revision No 

-.93 

.87 

-.67 

.98 

.23 

.90 

1.30 

.99 

Yes 

-.88 

.83 

-.64 

.94 

.22 

.92 

1.21 

.93 


deviations, and sample sizes is available, the group differences are com- 
puted using the weighted average of the groups’ Standard deviations. 
The equation is given in note 25 for Chapter 13.The primary effect of 
the skew was to squeeze the Standard deviation of the higher-sconng 
group (whires) and, in comparison, elongate the Standard deviation of 
the lower scoring groups. Correcting for skew thus shrank both the 
black-white and Latino-white differences. The same phenomenon ah 
fected all comparisons involving subgroups with markedly different 


AFQT means. All standardized AFQT scores, for both the regression 
analyses and the age-equated scores, are therefore corrected for skew. ln 
other words, each represents the standardized score in a normal distri- 
bution that corresponds to the (unrounded) centile score of the suhject 
in the observed distribution. 

The effects of the different scoring methods on ethnic differences 
raise a larger question that we should answer directly: How would the 
results presented in this book he different if we had used the 1980 ver- 
sion of the AFQT instead of the 1989 version? If we had not corrected 
for skew instead of correcting for skew? For most analyses, the answer is 
that the results are unaffected. But it may also be said that whenever 


Why Not Just Use Centiles? 


One way of avoiding the skew prob lern is to leave the AFQT scores in een- 
tiles. This was unsatisfactory, however, for we knew from eollateral data 
that much of the important role of IQ occurs at the tails of the distribu- 
tion. Using centiles throws away information about the tails. (See Ap¬ 
pendix 1 on the normal distribution.) 


differences were found, the scoring procedure we used tended to pro- 
duce smaller relationships hetween IQ and the indicators, and smaller 
ethnic differences, than the alternatives. We did not compute every 
analysis hy each of the four scoring permutations, but we did replicate 
all of the analyses using the two extremes (1980 Version uncorrected for 
skew and the 1989 Version corrected for skew). In no instance did the 
1989 version corrected for skew—the version reported in the text 
yield significant findings that were not also found when using the 1980 
uncorrected version. ln terms of the relationships explored in this book, 
the 1989 version corrected for skew is the most conservative of rhe al¬ 
ternatives. 

THE SOCIOECONOM1C STATUS INDEX 

The SES index was created with the variables that are common ly used 
in developing measures of socioeconomic Status: education, income, 
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and occupation. Since the purpose of the index was to measure the so- 
cioeconomic environment in which the NLSY youth was raised, the 
specific variables employed referred to the parents’ Status: total net tarn- 
ily income, mother s education, father’s education, and an index of oc- 
cupational Status of the adults living with the subject at the age of 14. 
The population for the computation was limited to the 1 1,878 NLSY 
subjects with valid AFQT scores. ln more detail: 

Mothers education and father’s education were based on years of ed¬ 
ucation, converted to standardized scores. 

Family income was based on the averaged total net family income for 
1978 and 1979, in constant dollars, when figures for both years were 
available. If income for only one of the two years was reported, that year 
was used. Family income was excluded if the subject was a Schedule O 
interne wee (the reported income for the year in question referred to his 
or her own income, not to the parental household s income). The Jol- 
lar figure was expressed as a logarithm before being standardized. This 
procedure, customary when working with income data, has the effecr of 
discounting extremely high values of income and permitting greater dis- 
crimination among lower incomes. A minimum standardized value of 
—4 was set for incomes of less than $1,000 (all figures are in 1990 
dollars). 

Parental occupation was coded with a modified Version of the l)un- 
can socioeconomic index, grouping the Duncan values (which go trom 
1 to 100) into deciles. A value of T was assigned to persons out of the 
labor force altogether. It was assumed that the familys socioeconomic 
Status is predominantly determined by the higher of the two occupa- 
tions held by two parents. Thus the occupational variable was based on 
the higher of the two ratings of the two parents. The increment in so¬ 
cioeconomic Status represented by both parents holding high-sratus oc- 
cupations is indirectly reflected in the higher income and in the two 
educational variables. The eleven values in the modified Duncan scale 
were standardized. 

The reliability of the four-indicator index (Cronhach’s a) is .76. The 
correlations among the components of the index are shown in the tahle. 
The four variables were summed and averaged. If only a suhset of vari¬ 
ables had valid scores, that subset was summed and averaged. By far the 
most common missing variable was family income, since many of the 
NLSY youths were already living in independent households as of 


Correlations of Indicators in the 

Socioeconomic Status Index 

Mother’s 

Father’s 

Parental 

Education 

Education 

Occupation 

Fathers education .6 $ 

— 


Parental occupation .47 

.55 

— 

Family income .36 

.40 

.47 


the beginning of the survey, and hence were reporting their own 
income, not parental income. Overall, data were available on all four 
indicators for 7,447 subjects, for three on an additional 3,612, on two 
for 679, and on one for 1 38. Two subjects with valid scores on the AFQT 
had no Information available on any of the four indicators. For use in 
the regression analyses, the SES index scores were set to a mean of 0 
and a Standard deviation of 1. 

EDUCATIONAL ATTAINMENT 

Highest Urade Completed. 

The NLSY creates a variable each year for “highcst grade completed;’ 
incorporating information trom several questions.' For analyses based 
on the oceurrence of an event (e.g., the hirth ot a child), the value of 
“highcst grade completed“ for the contemporaneous survey year is used. 
For all other analyses, the 1990 value for “highest grade completed is 
used. Values run trom 0 through 20. 

Highest Degree Ever Received 

ln the 1988-1990 surveys, the NLSY asked respondents to report the 
highest degree they had ever received. The possible responses were: high 
school diploma, associate degree, bachelor of arts, bachelor of Science, 
masters, Ph.D., Professional degree (law, medicine, dentistiy), and 
“oLher.“ These self-reported degrees were sonne times questionable, es- 
pecially when the degree did not correspond ro the numher of years of 
education (e.g., a bachelor’s degree for someone who also reported only 
fourteen years of education). To climinate the most egregiously suspi- 
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cious cases, we in ade adjustments. For those who reported their highes t 
degree as being a high school diploma, we required at least eleven re- 
ported years of completed education. For degrees beyond the high school 
diploma, we required that the report of the highest grade completed he 
within at least one year of the normal number of years required to ob- 
tain that degree. Specifically, the minimum number of years of com- 
pleted years of education required to use a reported degree were thirteen 
for the Associated degree, fifteen for a bachelor’s degree, sixteen for a 
master s degree, and 18 for a Ph.D., law degree, or medical degree. 

We also employed the NLSY’s variables to discriminate between 
those whose terminal degree was a high school diploma versus a GED. 
We excluded the 190 persons whose degree was listed as “other,” alter 
trying fruitlessly to come up with a satisfactory means of estimating what 
the “other” meant frorn collateral educational data. 

The “high school” and “college graduate” samples used rhronghout 
Part II are designed to isolate populations with homogeneous educa¬ 
tional experiences as of the 1990 survey year. The high school sample 
is defined as those who reported twelve years of completed education 
and a high school diploma received through the normal process (i.e., 
excluding GEDs) as the highest attained degree. The College graduate 
sample is defined as all those who reported sixteen years of completed 
education and a B.A. or B.S. as the highest attained degree. 

Transition to College 

In Chapter 1, we used the NLSY to determine the percentage of stu- 
dents in various IQ groupings who went directly to College. We limited 
the analysis to students who obtained a high school diploma between 
January 1980 and July 1982, meaning that all subjects had taken the 
AFQT prior to attending College. The analysis thus also reflecrs the ex- 
per ie nee of those who ob tain their high school diploma via the normal 
route (comparable to the analyses from the 1960s and 1920s, which 
are also reported in the same figure). A suhject is classified as attending 
College in the year following graduation if he reported having enrolled 
in College at any point in the calendar year following the date of 
graduation. 


MARITAL AND FERTILITY VARIABLES 

All variables relating to marital history and childbearing employed the 
NLSY’s synthesis as contained in the 1990 Fertility File of the NLSY. 

BIRTH WEIGHT 

The most commonly reported measure of a problernatic hirth weight is 
“low birth weight,” defined as no more than 5.5 pounds. In its raw.form, 
however, low birth weight is limited as a measure becau.se ir is con- 
founded with prematurity. A baby born five weeks prematurely will 
probably weigh less than 5.5 pounds and yer be a fully developed, 
healthy child for gestational age, with excellent prospects. Conversely, 
a child carried to renn but weighing slightly more than the cutoff of 5.5 
pounds is (given parents of average stature) small for its gestational age. 
We therefore created a variable expressing the baby’s birth weight as a 
ratio of the weight for fetuses at the 50th centile for that gestational age, 
using the Colorado Intrauterine Growth Charts as the hasis for the com- 
putation. If a baby weighed less than 5.5 pounds but the ratio was equal 
to or greater than 1, that case was excluded from the analysis. All uses 
of this variable in Chapters 10 and 13 are based on a sample that is ex- 
clusively white (Latino or non-Latino) or black, therehy sidestepping 
the complications that would be introduced by the populations of 
smaller stature, such as East Asians. We further excluded cases report- 
ing gestational ages of less than twenty-six weeks, reports of pregnan- 
cies that lasted more than forty-four weeks or hirth weights in excess of 
thirteen pounds, and one remarkahle case in which a mother reported 
gestation of twenty-six weeks and a birth weight of more than twelve 
pounds. 



Appendix 3 


Technical Issues Regarding the 
Armed Forces Qualification Test 
as a Measure of IQ 


Throughout The BellCurve, we use the Armed Forces Qualification Test 
(AFQT) as a measure of IQ. This appendix discusses a variety of related 
issues that may help readers Interpret the meaning of the analyses pre- 
sented in the full text. 

DOES THE AFQT MEASURE THE SAME THING THAT IQ TESTS 
MEASURE? 

The AFQT is a paper-and-pencil test designed for youths who have 
reached their late teens. ln effect, it assumes exposure to an ordinary 
high school education (or the opportunity to get one). This kind of re- 
striction is shared hy any IQ test, all of which are designed for certain 
groups. 

The AFQT as scored hy the armed forces is not age referenced. The 
armed forces have no need to do so, because the overwhelming major- 
ity of recruits raking the test are 18 and 19 years old. ln contrast, the 
NLSY sample varied from 14 to 23 years old when they took the test. 
Therefore, as discussed in Appendix 3, all analyses in the book take age 
into account through one of two methods: entering age as an indepen¬ 
dent variable in the multivariate analyses, and, for all descriptive sta- 
tistics, age referencing the AFQT score by expressing it in terms of the 
mean and Standard deviation for each years birth cohort. ln this ap¬ 
pendix, we will uniformly use the age-referenced Version for analyses 
based on the NLSY. 

Is a set of age-referenced AFQT scores appropriately treated as IQ 
scores? We approach this issue from two perspectives. First, we examine 
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the internal psychometric properties of the AFQT and sKow that the 
AFQT is one of the most highly gdoaded mental tests in current use. It 
seems to tlo what a good IQ test is supposed to do—tap into a general 
factor rather than specific hits of leaming or skill—as well as or beLter 
than its competitors. Second, we examine rhe correlation between the 
AFQT and other IQ tests, and show that the AFQT is more high ly cor- 
related with a wide ränge of other mental tests than those other inen' 
tal tests are with each other. On both counts, the AFQT qualifies not 
just as an IQ test, hut one of the better ones psychometrically. 

Psychometric Characteristics of the ASVAB 

Let us hegin by considering the larger test from which the AFQT is com' 
puted, the ASVAB (Armed Services Vocational Aptitude Battery), 
taken every year hy between a half million and a million young adults 
who are applying for entry into one of the armed Services. The ASVAB 
has ten subtests, spanning a ränge from test items that could appear 
equally well on Standard tests of intelligence to items testing knowledge 
ot automobile repair and electronics. 111 Scores on the subtests determine 
whether the applicant will be accepted by his chosen branch of service; 
for those accepted, the scores are later used for the placement of enlisted 
personnel into military occupations. How well or poorly a person per' 
forrns in military occupational Training schools, and also how well he 
does on the job, can therefore be evaluated against the scores earned on 
a battery of Standardized tests. 

The ten subtests of ASVAB can be paired off into forty-five corre- 
lations. Of the forty-five, the three highest correlations in a large study 
of enlisted personnel were between Word Knowledge and General Sen 
ence, Word Knowledge and Paragraph Completion, and, highest of al l, 
between Mathematics Knowledge and Arithmetic Reasoning," Gorre- 
lations above .8, as these were, are in the ränge observed between dif¬ 
ferent IQ tests, which are frankly constructed to measure the same 
atrrihute. To see them arising between tests of such different subject 
matter should alert us to some deeper level of mental functioning. The 
three lowest correlations, none lower than .22, were between Coding 
Speed and Mechanical Comprehension, Numerical Operations and 
Auto/Shop Information, and, lowest of all, between Coding Speed and 
Automobile/Shop Information. Between those extremes, there were 
rather large correlations between Paragraph Completion and General 


Science and between Word Knowledge and Electronics Information 
hut only moderate correlations between Electronics Information and 
Coding Speed and between Mathematics Knowledge and Automo¬ 
bile/Shop Information. Thirty-six of the forty-five correlations were 
above .5. 

Psychometrics apprnaches a table of correlations with one or another 
of its methods for factor analysis. Factor analysis (or other mathemati- 
cal procedures that go under other names) extracts the factors 111 that ac- 
count for the observed pattern of subtest scores. The Basic idea is that 
scores on any pair of tests are correlated to the extent that the tests mea¬ 
sure something in common: If they test traits in common, they are cor¬ 
related, and if not, not. Factor analysis teils how many different 
underlying factors are necessary to account for the ohserved correlations 
between rhem. If, for example, the subtest scores were totally uncorre- 
lared, ir would take ten independent and equally significant factors, one 
for each subtest by itself. With each test drawing on its own unique fac¬ 
tor, the forty-five correlations would all be zeros. At the other extreme, 
if the subtests measured precisely the same thing down to the very small- 
est detail, then all the correlations among scores on the subtests could 
be explained by a single factor—that thing which all the subtests pre- 
cisely measured—and the correlations would all be ones. Neither ex¬ 
treme descrihes the actuality, hut for measures of intelleetual 
performance, one large factor comes closer than many small ones. This 
is not the place to dwell on mathematical details except to note that, 
contrary to claims in nontechnical works, 4 the conclusions we draw 
about general intelligence do not depend on the particular method of 
analysis usedS 

For the ASVAB, 64 percent of the variance among the ten subtest 
scores is accounted for by a single factor, g. A second factor accounts for 
another 13 percent. With three inferred factors, 82 percent of the vari¬ 
ance is accounted for. |6i The intercorrelations indicate that people do 
vary importantly in some single, underlying trait and that those varia- 
tions affect how they do on every test. Nor is the predominance of g a 
fortuitous result of the particular subtests in ASVAB. The air force’s ap¬ 
titude test for prospective officers, the AFOQT (Air Force Officer Qual- 
ifying Test) similarly has g as its major source of individual Variation. 7 
Indeed, all broad-gauged test batteries of cognitive ability have gas their 
major source of Variation among the scores people get.‘ s 
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The naive theory assumes that when scores on two subtests are cor- 
related, it is because of overlapping content. But it is impossible to make 
sense of the varying correlations between the subtests in terms of over- 
lapping content. Consider again the correlation between Arithmetic 
Reasoning and Mathematical Knowledge, which is the highest of all. k 
may seem to rest simply on a knowledge of mathematics and arithmetic. 
However, the score on Numerical Operations is less correlated with ei- 
ther of those two tests than the two are with each other. Content pro- 
vides no clue as to why. Arithmetic Reasoning has only word problems 
on it; Mathematical Knowledge applies the basic methods of algebra 
and geometry; and Numerical Operations is an arithmetic fest. Why are 
scores on algebra and geometry more similar to those on word problems 
than to those on arithmetic? Such variations in the correlations be¬ 
tween the subtests arise, in fact, less from common content than from 
how much they draw on the underlying ability we call g. The varying 
correlations between the subtests preclude explaining g away as, for ex- 
ample, simply a matter of test-taking ability or test-taking experience, 
which should affect all tests more or less equally. V/e try to make some 
of these ideas visible in the figure below. 


The relation of the ASVAB subtests to each other and to g 


Correlation with g 
1 . 0 - 


0.9- 

0 . 8 - 

0.7- 


0 . 6 - 


E 

/• 

MC 


MK 


ARGS 
WK 


PC 


NO 


AS 


Bold: Subtests used 
in the AFQT 


i ■ ' ' ' f J ■ 'i 

0-5 0.6 0.7 

Average correlation with the other subtests 


For each subtest on ASVAB, we averaged the nine correlations with 
each of the other subtests, and that average correlation defines the hor¬ 
izontal axis. The vertical axis is a measure, for each subtest, of the 
correlation between the score and g The two-letter Codes identify 
the subtests. At the top is General Science (GS), closely followed 
by Word Knowledge (WK), and Arithmetical Reasoning (AR), for 
which the scores are highly correlated with g and have the highest 
average correlations with all the subtests. Another three subtests— 
Mathematics Knowledge (MK), Paragraph Comprehension (PC), and 
Electronics Information (EI)—are just slighrly below the top clusfer 
in both respects. At the bottom are Coding Speed (CS), Auto¬ 
mobile/Shop Information (AS), Numerical Operations (NO), and 
Mechanical Comprehension (MC), subtests that correlate, on the 
average, the least with other subtests and are also the least correlated 
with g (although still substantiaüy correlated in their own right). The 
bottom group includes the two speeded subtests, CS and NO, thereby 
refuting another common misunderstanding about g, which is that 
it refers to mental speed and little more. Virtually without exception, 
the more dependent a subtest score is on g, the higher is its average 
correlation with the other subtests. This is the pattern that betrays 
what g means—a broad mental capacity that permeates pertorm- 
ance on anything that challenges people cognitively. A rough rille 
of thumb is that items or tests that require mental complexity draw 
more on g than items that do not—the difference, for example, 
bet ween simply repeating a string of numbers after hearing them once, 
which Joes not much test g, and repeating them in reverse order, which 
does. 10 

The four subtests used in the 1989 scoring version of the AFQT (the 
one used throughout the text) and theirgloadings are Word Knowledge 
(.87), Paragraph Comprehension (.81), Arithmetic Reasoning (.87), 
and Mathematics Knowledge (.82). |llJ The AFQT is thus one of the 
most highly g-loaded tests in use. By way of comparison, the factor load- 
ings for the eleven subtests of the Wechsler Adult Intelligence Scale 
(WAIS) ränge from .63 to .83, with a median of .69. 1121 Whereas the first 
factor, g, accounts for over 70 percent of the variance in the AFQT, it 
accounts for only 53 percent in the WAIS. 
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Correlations of the AFQT with Other IQ Tests 

Our second approach to the question, Is the AFQT an IQ test? is to ask 
how the AFQT correlates with other well-known standardized mental 
tests (see the table below). We can do so by making use of the hij>h 
school transcript survey conducted by the NLSY in 1979. In addition 
to gathering information about grades, the survey picked up any other 
IQ test that the Student had taken within the school system. The data 
usually included both the test score and the percentile rank, hased on 
national norms. In accordance with the recommendation of the NLSY 
User's Manual, we use percentiles throughout. 15 


Correlations of the AFQT with 
Other IQ Tests in The NLSY 


California Test of Mental Maturity 

Sample 

356 

Correlation 
with the 

AFQT 

.81 

Coop School and College Ability Test 

121 

.90 

Differential Atitude Test 

443 

.81 

Henmon Nelson Test of Mental Maturity 

152 

.71 

KuhlmanmAnderson Intelligence Test 

36 

.80 

Lorge'Thomdike Intelligence Test 

170 

.72 

Otis-Lennnon Mental Ability Test 

530 

.81 


The magnitudes of the correlations between the AFQT (using the 
age-referenced percentile scores) and classic IQ tests are as high as or 
higher than the observed correlations of the classic IQ tests with each 
other. For example, the best-known adult test, the WA IS, is known to 
correlate (using the median correlation with various studies, and not 
correcting for restriction of ränge in the samples) with the Stanford-Bi- 
net ar .77, with the Ravens Standard Progressive Matrices at .72, the 
SRA Non-verbal test at .81, the Peabody Picture Vocabulary Test at .83, 
and the Otis at .78. 14 The table below summarizes the intercorrelations 
of IQ tests, hased on the comparisons assembled by Arthur Jensen as of 
1980, and adding a line for the AFQT comparisons front the NLSY. The 
AFQT compares favorably with the other major IQ tests by this mea- 
sure, which in turn is consistent with the high g-loading of the AFQT. 


Correlations of the Major IQ Tests with 

Other Standardized Mental Tests 

AFQT (age-referenced, 1989 scoring) 

Median Correlation 
with Other 

Mental Tests 
.81 

Wechsler-Bellevue I 

.73 

Wechsler Adulr Intelligence Scale (WAIS) 

.77 

Wechsler Intelligence Scale for Chiidren 

.64 

Stanford-Binet 

.71 

So um 1 : Jensen 1980, Table 8.5, and au r Kor 's analysi: 

s of the NLSY. 


HOW SENSITIVE ARE THE RESULTS TO THE ASSUMPTION 
THAT IQ IS NORMALLY DISTRIBUTED? 

Any good test designed to measure a complex ahiliry (whether a test of 
cognitive ahility or carpentry ability) will have several characteristics 
that common sense says are desirable: a large numher of items, a wide 
ränge of difficulty among the items, no marked gaps in rhe difficulty of 
the items, a variety of types of items, and items that have some reim 
tionship to each other (i.e., are ro some degree measuring the same 
thing). 11 Empirically, tests with these characteristics, administered to a 
representative sample of those for whom the test is intended, will yield 
scores that are spread out in a fashion resemhiing a normal distribution, 
or a hell curve. In this sense, tests of mental ability are not designed to 
produce normal ly distributed scores; that s just what happens, the same 
way that height is normally distributed without anyone planning it. 

1t is also true, however, that tests are usually scored and standardized 
under the assumption that intelligence is normally distributed, and this 
has led to allegations that psychometricians have hamhoozled people 
into accepring that intelligence is normally distributed, when in fact it 
may just be an artifact of the way they choose to measure intelligence. 
For a response to such allegations, Chapter 4 of Arthur Jensen s Bios in 
Mental Testing (New York: Free Press, 1980) remains the best discussion 
we have seen. 

For purposes of assessing the analyses in this book, it may help read- 
ers to know the extent to which any assumptions about the distribution 
of AFQT scores might have affected the resulrs, especially since we 
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rescored the AFQT to correct for skew (see Appendix 2). The descrip- 
rive statisticsshowing the breakdown ofeach variable hy cognitive dass, 
presented in each ehapter of Part II, address that issue. Assignment to 
cognitive classes was based on thesubject’s rank within the distribution, 
and these ranks are invariant no matter what the normality of the distri- 
hution might be. Ranks were also unaffected by the correcrion tor skew. 

The descriptive statistics in the text were bivariate. To examine this 
issue in a multivariate framework, we replicated the analyses of Part II 
substituting a set of nominal variables, denoting the cognitive classes, 
for the continuous AFQT measure. That is, the regression rreated 
“memhership in Class I” as a nominal variable, just as it would treat 
“rnarried” or “Latino” as a nominal characteristic—and similarly for the 
other four cognitive classes, also enrered as nominal variables (See Ap- 
pendix 4 for a discussion of how to interpret the coefficients for nomi¬ 
nal variables as created by the Software used in these analyses, JMP.3.0). 
Be low, we show rhe results for the opening analysis of Part II (Chapter 
5), the probahility of being in poverty. 

Comparison of results when AFQT is treated as a continuous, 
normally distributed variable and when it is treated as a 
set of nominal categories based on groupings by centile 


Probahility of being in poverty 

3 o r /c - 



centile) centile) centile) centile) centile) 

No re: For compuring tlie plor, age and SES were ser ar their mean value.y 


The results of the logistic regression analysis using the normally 
distributed AFQT score follow: 


Source 

DF 

Whole-Model Test. 
-LogLikelihood ChiSquare 

Proh>ChiSq 

Model 

5 

477222.0 954443.9 

0.000000 

Error 

4488 

4587166.7 


C Total 

4491 

5064388.7 



RSquare (U) 0.0942 

Observations 4,492 

Paramcrer Esrimates 


Term 

Estimar.e 

Std Error 

ChiSquare 

Proh>ChiSq 

lntercept 

-2.6579692 

0.0009826 


0.0000 

zAFQT89 

-0.8177031 

0.0012228 

447179 

0.0000 

zSES 

-0.2744971 

0.0011661 

55416 

0.0000 

-Age 

-0.0482156 

0.0009187 

2754.1 

0.0000 


These are the results using rhe categorization into cognitive classes hy 


ccnttlc: 





Whole-Model Test 



Source 

DF -LogLikelihood 

ChiSquare 

Prob>ChiSq 

Model 

6 383494.7 

766989.4 

0.000000 

Error 

4485 4680894.0 



("Total 

4491 5064388.7 




RSquare (U) 0.0757 




Observations 4,492 




Parameter Esrimates 


Tenn 

Estimate Std Error 

ChiSquare 

Prob>ChiSq 

lntercept 

-2.5097718 0.0015823 


0.0000 

CogClas.[l'S] 

-1.0067168 0.0050693 

39439 

0.0000 

('ogC3tas.[2-5| 

-0.6803606 0.0025486 

71265 

0.0000 

CogClas.[3Al 

-0.1905042 0.0018498 

10606 

0.0000 

CogCbis.[4-5] 

0.64764109 0.0021336 

92138 

0.0000 

:SE8 

-0.3902981 0.0011276 

119800 

0,0000 

zAge 

-0.1605992 0.000907 

31350 

0.0000 


We repeared these comparisons for a hroad sampling of the outcome 
variables discussed in Part 11. The results for poverty were typical. When 
the results for the two expressions of IQ do not correspond (e.g., rhe re- 
lationship of mother’s IQ to low birth weight, as discussed in Chapter 
10), the lack of correspondence also showed up in the bivariate table 
showing the breakdown by cognitive class. Or to put it unother way, the 
results presented in the text using IQ as a continuous, normally distiib- 
uted variable are produced as well when IQ is treated as a set of cate- 
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gories. Any exceptions to that may be identified through the bivariate 
tables based on cognitive dass. 


RELATIONSHIP OF THE AFQT SCORE TO EDUCATION AND 
PARENTAL SES 

The relationship of an IQ test score to education and socioeconomic 
background is a constant and to some extent unresolvable source of con- 
troversy. It is known that the environment (including exposure to edu¬ 
cation) affects realized cognitive ability* To that extent, it is 
conceptually appropriate that parental SES and years of education show 
an independent causal effect on IQ. On the other hand, an IQ test score 
is supposed to represent cognitive ability and to have an independent 
reality of its own; in other words, it should not simply be a proxy mea- 
sure of either parental SES or years of education. The following discus- 
sion elaborates on the Statistical relationship of both parental SES and 
years of education to the AFQT score. 

The Socioeconomic Status Index and the AFQT Score. 

The SES index consists of four indicators as described in Appendix 2; 
mother s and father s years of education, the occupational Status of the 
parent with the higher-status job, and the parents' total family income 
in 1979—1980. The correlations of the index and its four constituent 
variables with the AFQT are in the table below. 


Intercorrelations of the AFQT and the Indicators in the 
Socioeconomic Status Index 


AFQT 

Mother’s education 

.43 

Father’s education 

.46 

Occupational Status 

.43 

Family income 

.38 

SES Index 

.55 


The correlation of AFQT with the SES index itself is .55, consistent 
with other investigations of this topic. 16 

There are three broad interpretations of these correlations: 


1. Test bias . IQ tests scores are artificially high for persons from high- 
status backgrounds because the tests are biased in favor of people 
from high-status homes. 

2. Environmental advantage . IQ tends to be genuinely higher for chil¬ 
dren from high-status homes, because they enjoy a more favorable 
environment for realizing their cognitive ability than do children 
from low-status homes. 

3. Genetic advantage . IQ tends to be genuinely higher for children 
from high-status homes because they enjoy a more favorable ge- 
netic background (parental SES is a proxy measure for parental 

IQ). 

The first explanation is discussed in Appendix 5. The other two ex- 
planations have been discussed at various points in the text (principally 
Chapter 4’s discussion of heritability, Chapter 10 s discussion of parent- 
ing styles, and Chapter 17’s discussion of adoption). To summarize those 
discussions, being brought up in a conspicuously high-status or low- 
status family from birth probably has a significant effect on IQ, in¬ 
dependent of the genetic endowment of the parents. The magnitude 
of this effect is uncertain. Studies of adoption suggest that the average 
is in the region of six IQ points, given the difference in the environ- 
ments provided by adopting and natural parents. Outside interventions 
to augment the environment have had only an inconsistent and un¬ 
certain effect, although it remains possible that larger effects might be 
possible for children from extremely deprived environments. In terms 
of the topic of this appendix, the flexibility of the AFQT score, the 
AFQT was given at ages 14-23, when the effect of socioeconomic back¬ 
ground on IQ had already played whatever independent role it might 
have. 

Years of Education and the AFQT Score 

For the AFQT as for other IQ tests, scores vary directly with educational 
attainment, leaving aside for the moment the magnitude of reciprocal 
cause and effect. But to what extent could we expect that, if we man- 
aged to keep low-scoring students in school for another year or two, their 
AFQT scores would have risen appreciably? 

Chapter 17 laid out the general answer from a large body of research: 
Systematic attempts to raise IQ through education (exemplified by 
the Venezuelan experiment and the analyses of SAT coaching) can 
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indeed have an effect on che order of .2 Standard deviation, or three IQ 
points. As far as anyone can teil, there are diminishing marginal bene- 
fits of this kind of coaching (taking three intensive SAT eoaching pro- 
grams in succession will raise a score by less than three times the original 
increment). 

We may explore the issue more directly by making use of the other 
IQ scores ohtained for memhers of the NLSY. Given scores that were 
obtained several years earlier than the AFQT score, ro what extern do 
the intervening years of education appear to have elevated the AFQT? 

Underlying the discussion is a simple model: 


Earlier IQ 
score 



■+» AFQT 


Years of 
education 


score 



The earlier IQ score affects both years of education and is a measure of 
the same thing that AFQT measures. Meanwhile, the years of educa¬ 
tion add something (we hypothesize) to the AFQT score that would not 
otherwise have been added. 

Actually testing the model means bringing in several complications, 
however. The elapsed time between the earlier IQ test and the AFQT 
tost presumably affects the relationships. So does the age of the suhject 
(a subject who took the test at age 22 had a much different “chance” to 
add years of education than did a subject who took the test at age 18, 
for example). The age at which the earlier IQ test was taken is also rel¬ 
evant, since IQ test scores are known to become more stable at around 
the age of 6. But the main point of the exercise may he illustrated 
straightforwardly. We will leave the elaboration to our colleagues. 

The database consists of all NLSY students who had an earlier IQ) test 
score, asreported in the tableon page 596, plus students with valid Stan- 
ford-Biner and WISC scores (too few to report separately). We report 
the results for two models in the table below, with the AFQT score as 
the dependent variable in both cases. In the first model, the explana- 
tory variables are the earlier IQ score, age at the first test, elapsed years 
between the two tests, and type of test (entered as a vector of dummy 
variables), ln the second model, we add years of education as an inde¬ 
pendent variable. An additional year of education is associated with a 
gain of 2.3 centiles per year, in line with other analyses of the effects of 


The Independent Effect of Education on AFQT Scores 
as Inferred from Earlier IQ Tests 

Dependent v ariable: AFQT per centile score _ 

Independent Variables Model 1 Model 2 

Coefficient Std. Error Coefficient Std. Error 


Inrercept 

12.303 

1.653 

-6.783 

2.443 

Earlier IQ 

percentile score 

.787 

.016 

.753 

.015 

Elapsed years 

between tests 

-.316 

.166 

-1.005 

.173 

Years of education 

— 

— 

2.280 

.221 


Type of tesr (entered 
as a vector of nominal 
variables, coefficients 
not shown.) 

No. of observations 1,408 1,408 

R : (Adjusted) *659 .681 


education on IQ. 17 What happens if the dependent variable is expressed 
in Standardized scores rather rhan percentiles? ln that case (using the 
same independent variables), the independent effect of education is to 
increase the AFQT score by .07 Standard deviarion, or the equivalent 
of about one IQ point per year—also in line with other analyses. 

We caution against interpreting these coefficients literally across the 
entire educational ränge. Whereas it may be reasonable to think about 
IQ gains for six additional years of education when comparing subjects 
who had no schooling versus those who reached sixth grade, or even 
comparing those who dropped out in sixth grade and those who re- 
mained through high school, interpreting these coefficients becomes 
problematic when moving into post-high school education. 

The negative coefficient for “elapsed years between tests” in the table 
above is worth mentioning. Suppose that the true independent rela- 
rionship hetween years of education and AFQT is negatively acceler- 
ated—that is, the causa! importance of the elementary grades in 
developing a person’s IQ is greater than the causa 1 role of, say, graduate 
school. lf so, then the more years of Separation between tests, the lower 
would be the rrue value of the dependent variable, AFQT, compared to 
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Appendix 4 

Regression Analyses from Part II 


This appendix presents the logistic regressions for the figures in Chap- 
ters 5 through 12. In the text, the figures are based on regressions that 
use the entire white sample in the NLSY and are calculated using sam¬ 
ple weights. We use the entire sample and weights to take advantage of 
the NLSY’s supplemental sample of low-income whites; in our judg- 
ment, doing so provides the best available estimates of the relationships 
we discuss. But interpreting Standard errors and Statistical significance 
is greatly complicated when using sample weights. In the regression re- 
sults that follow, we therefore restrict the analyses to the nationally rep- 
resentative cross-sectional sample of whites. This procedure not only 
enables direct interpretation of the Standard errors but also provides the 
raw material for interested readers to see how much difference there is 
between the results from the entire white sample and the cross-sectional 
sample (which you may do by computing the probabilities for the cross- 
sectional sample and comparing them to the ones shown in the text fig¬ 
ures). We have done so ourselves and can report that the differences are 
so small that they are seldom visually evident. 

By “whites,” we mean all NLSY subjects who were identified as “non- 
black, non-Hispanic” in the NLSY’s racial/ethnic cohort screening 
(variable R2147, in the NLSY’s documentation), deleting those who 
identified themselves as being of American Indian, Asian, or Pacific de- 
scent in the “first or only racial/ethnic origin” item (R96). 

In the text, we do not refer to the usual measure of goodness of fit 
for multiple regressions, R 2 , but they are presented here for the cross- 
sectional analyses. As the ratio of the explained sum of squares to the 
total sum of squares, R 2 is in this instance the square of the correlation 
between the set of independent variables and the dependent variable 
expressed as the logarithm of the odds ratio. Inasmuch as the values of 
R 2 ränge widely in the tables to follow, some mention of them is war- 
ranted. 
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The size of R 2 teils something about the srrength of the logistic re- 
lationship between the depenclent variable and the set of independent 
variables, but it also depends on the composition of the sample, as do 
correlation coefficients in general. Even an inherently strong relatioiv 
ship can result in low values of R 2 if the data points are hunched in 
various ways, and relatively noisy relationships can result in high vah 
ues if ehe sample includes disproportionate numbers of outliers. For ex¬ 
ample, one of the smallest R 2 in the following analyses, only 0.17, is 
for white men out of the labor force for four weeks or more in 1989. 
Apart from the distributional properties of the data that produce this 
low R\ a rough common-sense meaning to keep in mind is that the 
vast majority of NLSY white men were in the labor force even though 
they had low IQs or deprived socioeconomic backgrounds. But the pa- 
rameter for zAFQT in that same equation is significant beyond the .001 
level and large enough to make a big difference in the probability that 
a white male would be out of the labor force. This illustrates why we 
therefore consider the regression coefficients themselves (and their as- 
sociated p values) to suit our analytic purposes better than R% and that 
is why those are the ones we relied on in the text. 

The Standard independent variables, described in Appendix 2, are 
zAFQT89 f the 1989 scoring of the AFQT; zSES, the socioeconomic 
Background of the NLSY suhjects; and zAge, based on the age of the 
NLSY subjects as of December 31, 1990. All are expressed as Standard 
scores with a mean of 0 and a Standard deviation of 1. 

All dependent variables are binary. The coefficients are parameter 
estimates when the dependent variable = “yes.” The linear logistic 
model has the form 

logit(p) - log(/>/(l-p)) = a + ß'x 

where a is the intercept parameter and ß is the vector of slope Para¬ 
meters for a vector of independent variables x. Take as an example the 
first set of results presented subsequently, involving poverty. Sup- 
pose you want to know the probability that a person is under the poverty 
line in 1989 (Poverty = “Yes”), stipulating that the person in question 
has an IQ (zAFQT) 1.5 Standard deviations below the mean, socioeco¬ 
nomic background (zSES) .3 Standard deviation above the mean, and 
is exactly of mean age. Using the parameters in the basic analysis for 
poverty rounded to four decimal places, and a computationally conve- 
nient re-expression of p, the probability is computed as follows: 


|ogit(p) = -2.6487 + (-.8376 x-1.5) + (-.3301 x. 3) + (-.0238 x 0) = -L4913 

p = e 1491 V( 1 + e 14vn ) 
p = .1837 

The probability we set out to compute is 18.37 percent. 

“The High School Sample” consists of those who received a high 
school diploma rhrough the normal route (not a GED) and reported ex¬ 
actly twelve years of education as of the 1990 interview. 

“The College Sample” consists of those who completed a bachelors 
degree and reported exactly sixteen years of education as of the 1990 in¬ 
terview. 

The Software used for the analyses is JMP Version 3, by SAS Insti¬ 
tute Inc. JMP treats nominal independent variables differently from 
other major Software packages such as SAS and SPSS. In those pack- 
ages, a parameter for a nominal variable represents the difference be¬ 
tween rhat level of the nominal variable and an omitted level serving 
as a reference group. ln JMP, a parameter represents the difference of a 
given level from the average over all levels of the nominal variable. The 
implied parameter for the remaining level is the negative sum of the 
other levels (i.e., the parameters sum to zero over all the effect levels). 
For example, suppose Race were being used as a nominal variable, with 
categories of Black, Latino, and White. In the JMP printout, the coef¬ 
ficients would appear as 

Race[Black-White] x, 

Race[Latino-White] x, 

The Order is determined hy the alphabetical order of the categories, 
ln this case, the coefficient x, applies to blacks, x, to Latinos. The 
implied White coefficient is -l*(x 1 + x 2 ). ln the case of a binary 
independent variable such as Sex, the printout would show a single 
line 

Sex[Female-Male] x, 

which applies to females. The coefficient for Male equals -x ( . 

CHAPTER 5: POVERTY 

PEPENDENT VARIABLE: Under the official poverty line in 1989. 

SAMPLE RESTRIOTlONS: Excludes those who reported they were out of 
the labor force because they were in school in either the 1989 or 1990 
Interviews. 
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Basic Analysis: 

Whole-Model Test 


Source 

DF 

-LogLikelihood 

ChiSquare 

Proh>ChiSq 

Model 

3 

90.94009 

181.8802 

0.000000 

Error 

3363 

784.40179 



C Total 

3366 

875.34188 




RSquare 

(U) 

0.1039 



Observations 

3367 




Parameter Estimates 


Term 

Estimate 

Std Error 

ChiSquare 

Proh>ChiSq 

Intercept 

-2.6487288 

0.0768803 

1187 

0.0000 

zAFQT89 

-0.8376338 

0.0935061 

80.25 

0.0000 

zSES 

-0.3300720 

0.0900996 

13.42 

0.0002 

zAge 

-0.0238375 

0.0723735 

0.11 

0.7419 

The High School Sample: 






Whole-Model Test 



Source 

DF 

-LogLikelihood 

ChiSquare 

Proh>ChiSq 

Model 

3 

22.01811 

44.03622 

0.000000 

Error 

1232 

325.26939 



C Total 

1235 

347.28750 




RSquare 

(U) 

0.0634 



Observations 

1236 




Parameter Estimate; 



Term 

Estimate 

Std Error 

ChiSquare 

Proh>ChiSq 

Intercept 

-2.7237775 

0.1290286 

445.63 

0.0000 

zAFQT89 

-0.8267293 

0.1627358 

25.81 

0.0000 

zSES 

-0.3619703 

0.1499855 

5.82 

0.0158 

zAge 

+0.1049227 

0.1094603 

0.92 

0.3 378 

The College Sample: Omitted. Only six persons in the cross^sectional 


College Sample were in poverty. 

For Mothers M arried as of the 1989 Interview: 

Whole-Model Test 

Source DF -LogLikelihood ChiSquare Proh>ChiSq 

Model 3 17.14553 34.29106 0.000000 

Error 786 179.84999 

C Total 789 196.99552 

RSquare (U) 

Observations 


0.0870 

790 
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Parameter Estimates 


Term 

Estimate 

Std Error 

ChiSquare 

Prob>ChiSq 

Intercept 

-2.7732817 

0.1646023 

283.87 

0.0000 

zAFQT89 

-0.6437797 

0.2140132 

9.05 

0.0026 

zSES 

-0.3910629 

0.2020317 

3.75 

0.0529 

zAge 

-0.3338674 

0.1587605 

4.42 

0.0355 


For Mothers Who Were Separated, Divorced, or Never M arried as of the 
1989 Interview: 

Whole-Model Test 

Source DF -LogLikelihood ChiSquare Prob>ChiSq 

Model 3 8.07114 16.14228 0.001060 

Error 21 1 1 35.77658 

C Total 214 143.84772 

RSquare (U) 0.0561 

Observations 215 

Parameter Estimates 

Term Estimate Std Error ChiSquare Prob>ChiSq 

Intercept -0.7449132 0.1713794 18.89 0.0000 

zAFQT89 -0.6722121 0.2277019 8.72 0.0032 

zSES -0.1597461 0.1952709 0.67 0.4133 

zAge -0.1524315 0.1530986 0.99 0.3194 

CHAPTER 6: SCHOOLING 

DEFENDENT VARIABLE: Permanently dropped out of high school. 

SAMPLE RESTRltmONS: Excludes those who obtained a GED. 

Basic Analysis: 

Whole-Model Test 

Source DF -LogLikelihood ChiSquare Proh>ChiSq 

Model 3 393.8978 787.7956 0.000000 

Error 3568 779.9904 

C Total 3571 1 173.8882 

RSquare (U) 0,3355 

Observations 3572 

Parameter Estimates 

Term Estimate Std Error ChiSquare Proh>ChiSq 

Intercept -2.85322606 0.0939659 922.00 0.0000 

zAFQT89 -1.72295934 0.1028145 280.83 0.0000 

zSES -0.64776232 0.0896658 52.19 0.0000 

zAge +0.05695640 0.0688286 0.68 0.4079 
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DEPENDENT VARIABLE: Received a GED instead of a high school diploma. 
SAMPLE RESTRICTIONS: Excludes those who obtained neither a high 
school diploma nor a GED* 


Basic Analysis: 






Whole-Model Test 


Source 

DF 

-LogLikelihood 

ChiSquare 

Prob>ChiSq 

Model 

3 

72.06475 

144.1295 

0.000000 

Error 

3490 

915.28145 



C Total 

3493 

987.34620 




RSquare 

(U) 

0.0730 



Observations 

3494 




Parameter Estimates 


Term 

Estimate 

Std Error 

ChiSquare 

Prob>ChiSq 

Intercept 

-2.3548461 

0.0653867 

1297 

0.0000 

zAFQT89 

-0.4325254 

0.0851185 

25.82 

0.0000 

zSES 

-0.6082151 

0.0837515 

52.74 

0.0000 

zAge 

-0.0416441 

0.0662445 

0.40 

0.5296 


DEPENDENT VARIABLE: Received a bachelor s degree* 

SAMPLE RESTRICTIONS: Excludes those who had less than a bachelor’s de^ 
gree and were in postsecondary education in either the 1989 or 1990 
interview* 


Basic Analysis: 





Whole-Model Test 

Source 

DF 

-LogLikelihood 

ChiSquar 

Model 

3 

807.9072 

1615.814 

Error 

3817 

1364.3417 


C Total 

3820 

2172.2489 



RSquare (U) 

0.3719 


Observ 

ations 

3821 


Prob>ChiSq 

0*000000 
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Parameter Estimates 


Term 

Estimate 

Std Error 

ChiSquare 

Prob>ChiSq 

Intercept 

-2.41992250 

0.0786991 

945.50 

0.0000 

zAFQT89 

+ 1.80771403 

0.0795537 

516.34 

0.0000 

zSES 

+1.04818417 

0.0690372 

230.52 

0.0000 

zAge 

-0.29777760 

0.0516373 

33.25 

0.0000 


CHAPTER 7: UNEMPLOYMENT, IDLENESS, AND INJURY 

DEPENDENT VARIABLE: Out of the labor force for four weeks or more in 
1989* 

SAMPLE RESTRICTIONS: Civilian males who did not respond “unable to 
work” or “in school” to the question on labor force participation in 
the 1989 or 1990 interview* 


Basic Analysis: 

Whole-Model Test 


Source 

DF -LogLikelihood 

ChiSquare 

Prob>ChiSq 

Model 

3 9.44293 

18.88586 

0.000289 

Error 

1682 548.25144 



C Total 

1685 557.69437 




RSquare (U) 

0.0169 



Observations 

1686 



Parameter Estimates 


Term 

Estimate Std Error 

ChiSquare 

Prob>ChiSq 

Intercept 

-2.20264085 0.0868001 

643.94 

0.0000 

zAFQT89 

-0.36246881 0.0992802 

13.33 

0.0003 

zSES 

+0.21788340 0.1075722 

4.10 

0.0428 

zAge 

-0.12815393 0.0864018 

2.20 

0.1380 

The High School Sample: 




Whole-Model Test 



Source 

DF -LogLikelihood 

ChiSquare 

Prob>ChiSq 

Model 

3 4.45831 

8.916625 

0.030420 

Error 

617 156.98046 



C Total 

620 161.43878 




RSquare (U) 

0.0276 



Observations 

621 



Parameter Estimates 


Term 

Estimate Std Error 

ChiSquare 

Prob>ChiSq 

Intercept 

-2.69780012 0.1767563 

232.95 

0.0000 

zAFQT89 

-0.42151253 0.2264362 

3.47 

0.0627 

zSES 

+0.56489480 0.230053 

6.03 

0.0141 

zAge 

-0.14556950 0.1672623 

0.76 

0.3841 
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The College Sample: 




Whole-Model Test 



Source 

DF -LogLikelihood 

ChiSquare 

Prob>ChiSq 

Model 

3 6.794337 

13.58867 

0.003522 

Error 

264 56.536860 



C Total 

267 63.331196 




RSquare (U) 

0.1073 



Observations 

268 



Parameter Estimates 


Term 

Estimate Std Error 

ChiSquare 

Prob>ChiSq 

lntercept 

-3.12957075 0.6081769 

26.48 

0.0000 

2AFQT89 

-0.84324247 0.4526768 

3.47 

0.0625 

zSES 

+0.94514750 0.3875388 

5.95 

0.0147 

zAge 

-0.46061574 0.299044 

2.37 

0.1235 


DEFENDENT VARIABLE: Unempioyed for four weeks or niore in 1989. 
SAMPLE RESTR1CTIONS: Civilian males who did not respond “unahle to 
work” or “in school” to the question on labor force participation in 
the 1989 or 1990 interview and were in the labor force throughout 
1989. 

Basic Analysis; 




Whole-Model Test 



Source 

DF 

-LogLikelihood 

ChiSquare 

Prob>ChiSq 

Model 

3 

11.30841 

22.61682 

0.000049 

Error 

1393 

348.7151 1 



C Total 

1396 

360.02353 




RSquare 

(U) 

0.0314 



Observations 

1397 




Parameter Estimates 


Term 

Esrimate 

Std Error 

ChiSquare 

Proh>ChiSq 

lntercept 

-2.53577016 0.1076083 

555.30 

0.0000 

zAFQT89 

-0.49486463 0.1298967 

14.51 

0.0001 

zSES 

-0.02534849 0.1383889 

0.03 

0.8547 

zAge 

-0.02181428 0.1108396 

0.04 

0.8440 

The High 

School Sample: 






Whole-Model Test 


Source 

DF 

-LogLikelihood 

ChiSquare 

Prob>ChiSq 

Model 

3 

1.86533 

3.730657 

0.292056 

Error 

533 

140.49123 



C Total 

536 

142.35656 




RSquare 

(U) 

0.0131 



Observations 

537 



Parameter Estimates 


Term 

Estimate 

Std Error 

ChiSquare 

Prob>ChiSq 

lntercept 

-2.59878187 

0.1766146 

216.51 

0.0000 

zAFQT89 

-0.39353140 

0.2368752 

2.76 

0.0966 

zSES 

+0.13951940 

0.2353179 

0.35 

0.5532 

zAge 

-0.10510566 

0.1762471 

0.36 

0.5509 

The College 

Sample: 






Whole-Model Test 


Source 

DF 

-LogLikelihood 

ChiSquare 

Prob>ChiSq 

Model 

3 

3.570096 

7.140193 

0.067561 

Error 

224 

40.506133 



C Total 

227 

44.076230 




RSquare (U) 

0.0810 



Observations 

228 



Parameter Estimates 


Term 

Estimate 

Std Error 

ChiSquare 

Prob>ChiSq 

lntercept 

-3.1686886 

0.7276735 

18.96 

0.0000 

:AFQT89 

-0.9196886 

0.5641635 

2.66 

0.1031 

zSES 

+ 1.0039255 

0.5015717 

4.01 

0.0453 

zAge 

+0.2941965 

0.3311174 

0.79 

0.3743 


CHAPTER 8: FAMILY MATTERS 


DEFENDENT VARIABLE: Ever married befere the age of 30. 

SAMPLE RESTRICTIONS: Persons who turned thirty by the 1990 interview. 


Basic Analysis 


Whole-Model Test 



Source 

DF 

-LogLikelihood 

ChiSquare 

Prob>ChiSq 

Model 

3 

6.43345 

12.8669 

0.004933 

Error 

1630 

839.76747 



C Total 

1633 

846.20092 




RSquare 

(U) 

0.0076 



Observations 

1634 




Parameter Estimates 


Term 

Estimate 

Std Error 

ChiSquare 

Prob>ChiSq 

lntercept 

+1.19841361 0.128902 

86.44 

0.0000 

zAFQT89 

-0.0473587 

0.0757854 

0.39 

0.5320 

zSES 

-0.1905526 

0.0786307 

5.87 

0.0154 

zAge 

+0.20403379 0.1290545 

2.50 

0.1139 
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The High School Sample: 

Whole-Model Test 


Source 

DF 

-LogLikelihood 

ChiSquare 

Proh>ChiSq 

Model 

3 

6.92871 

13.857 

0.003106 

Error 

601 

259.40296 



C Total 

604 

266.33168 




RSquare (U) 

0.0260 



Observations 

605 



Parameter Estimates 


Term 

Estimate 

Std Error 

C duSquare 

Proh>ChiSq 

lntercept 

+ 1.41494853 

0.2342703 

36.48 

0.0000 

zAFQT89 

+0.51424443 

0.1598383 

10.35 

0.0013 

zSES 

-0.1128845 

0.1582799 

.51 

0.4757 

zAge 

+0.36827169 

0.2422543 

2.31 

0.1285 

The College Sample: 






Whole-Model Test 


Source 

DF 

-LogLikelihood 

ChiSquare 

Proh>ChiSq 

Model 

3 

0.17181 

0.343616 

0.951627 

Emir 

233 

145.35748 



C Total 

236 

145.52929 




RSquare (U) 

0.0012 



Ohservations 

237 



i 

’arameter Estimates 



Term 

Estimate 

Std Error 

ChiSquare 

Proh>Chi8q 

lntercept 

+0.71372375 

0.3946174 

3.27 

0.0705 

zAFQT89 

+0.05013859 

0.2237528 

0.05 

0.8227 

zSE S 

+0.0968295 

0.1833680 

0.28 

0.5975 

zAge 

-0.0177807 

0.2950863 

0.00 

0.9520 


zSES 

+0.22195410 

0.0787612 

7.94 

0.0048 

zAge 

-0.17766944 

0.0741478 

5.74 

0.0166 

MarDatel 

-0.08677335 

0.0243113 

12.74 

0.0004 

The High School Sample , Adding Date of First M arriage (MarDatel): 


Whole-Model Test 



Source 

DF 

—LogLikelihood 

ChiSquare 

Proh>ChiSq 

Model 

4 

3.54304 

7.086073 

0.131409 

Error 

870 

428.70643 



C Total 

874 

432.24947 




RSquare (U) 

0.0082 



Observations 

875 



Parameter Estimates 


Term 

Estimate 

Std Error 

ChiSquare 

Proh>ChiSq 

lntercept 

+ 5.4451395 

3.1286887 

3.03 

0.0818 

ZÄFQT89 

-0.0379171 

0.1348129 

0.08 

0.7785 

zSES 

+0.2206925 

0.1288222 

2.93 

0.0867 

zAge 

-0.1078057 

0.1146773 

0.88 

0.3472 

MarDatel 

-0.0839950 

0.0383236 

4.80 

0.0284 


The College Sample , Adding Date of First Mau tage (Mar Datei ): 

Whole-Model Test 


Source 

DF 

-LogLikelihood 

ChiSquare 

Proh>ChiSq 

Model 

4 

5.548154 

11.09631 

0.025503 

Error 

204 

48.414468 



C Total 

208 

53.962623 




RSquare 

(U) 

0.1028 



LObservations 

209 



Parameter Estimates 


DH PENDENT VARIABLE: Divorced within the first five years of marriage. 
SAMPLE RESTRICTIONS: Persons married prior to January 1, 1986. 


Basic Analysis, Adding Date of First Maniage (Marl)atel): 

Whole-Model Test 


Source 

DF 

-LogLikelihood 

ChiSquare 

Proh>ChiSq 

Model 

4 

21.8881 

43.77626 

0.000000 

Error 

2026 

991.3719 



C Total 

2030 

1013.2600 




RSquare (U) 

0.0216 



Observations 

2031 



Parameter Estimates 


Term 

Estimate 

Srd Error 

ChiSquare 

Proh>ChiSq 

lntercept 

+ 5.70860970 

1.9858067 

8.26 

0.0040 

ZÄFQT89 

-0.35734009 

0.0781258 

20.92 

0.0000 


Term 

Esrimare 

Std Error 

ChiSquare 

Prob>ChiSq 

lntercept 

+ 32.392875 

13.508886 

5.75 

0.0165 

zAFQT89 

-0.75619367 

0.4502182 

2.82 

0.0930 

zSES 

-0.07354619 

0.3588816 

0.04 

0.8376 

zAge 

-0.55875424 

0.4046911 

1.91 

0.1674 

MarDate l 

-0.41113710 

0.1629791 

6.36 

0.0116 


Basic analysis, Adding Parental Living Arrangements at Age 14 (Adult 14) 


Source 
Model 
Error 
C Total 


Whole-Model Test 

DF -LogLikelihood ChiSquare 

6 16.7457 33.49136 

2022 994.6771 

2028 1011.4228 


RSquare (U) 
Observations 


0.0166 

2029 


Pmb>ChiSq 

0.000008 
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Parameter Estimates 


Term 

Estimare 

Std Error 

ChiSquare 

Prob>ChiSq 

lntercepr 

-1.2952650 

0.1066175 

147.59 

0.0000 

zAFQT89 

-0.3925580 

0.0774209 

25.71 

0.0000 

zSES 

+0.1910425 

0.0783345 

5.95 

0.0147 

zAge 

Adultl4 

-0.0278086 

0.0617722 

0.20 

0.6526 

[2 Bio-UnmarMom] 
Adult 14 

-0.1472662 

0.117616 

1.57 

0.2105 

[Bio/Step-UnmarMom] 

Adultl4 

+0.0621918 

0.1601525 

0.15 

0.6978 

[Other-UnmarMom] 

-0.0872024 

0.2553089 

0.12 

0.7327 


DEPENDENT VARIABLE: First birth out of wedlock. 
SAMPLE RESTRICTIONS: Women with at least one child. 


Basic Analysis: 

Whole-Model Test 


Source 

DF 

-LogLikelihood 

ChiSquare 

Prob>ChiSq 

Model 

3 

39.86862 

79.73723 

0.000000 

Error 

1217 

461.90618 



C Total 

1220 

501.77480 




RSquare 

(U) 

0.0795 



Observations 

1221 




Parameter Estimates 


Tenn 

Estimate 

Std Error C 

IhiSquare 

Proh>ChiSq 

Intercept 

-1.9432320 

0.0938185 

429.01 

0.0000 

zAFQT89 

-0.6537960 

0.1239489 

27.82 

0.0000 

zSES 

-0.3052597 

0.1189878 

6.58 

0.0103 

zAge 

-0.2405246 

0.0902516 

7.10 

0.0077 

The High School Sample: 






Whole-Model Test 


Source 

DF 

-LogLikelihood 

ChiSquare 

Proh>ChiSq 

Model 

3 

15.09449 

30.18898 

0.000001 

Error 

512 

187.89956 



C Total 

515 

202.99405 




RSquare 

(U) 

0.0744 



Observations 516 

Parameter Estimates 

Term Esrimate Std Error ChiSquare Prob>ChiSq 

Intercept -2.1890354 0.1658602 174.19 0.0000 

zAFQT89 -0.7846895 0.2142378 13.42 0.0002 

zSES -0.2428727 0.2165927 1.26 0.2621 

zAge -0.4145066 0.1447961 8.20 0.0042 
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The College Sample: 

Whole-Model Test 

Source HF -LogLikelihood ChiSquare Prob>ChiSq 

Model 3 1.1299340 2.259868 0.520253 

Error 1 12 4-6193334 

C Total 115 5.7492674 

RSquare (U) 0.1965 

übservations 116 

Parameter Estimates 

Term Esrimate Std Error ChiSquare Prob>ChiSq 

Intercept -6.37685240 3.9705049 2.58 0.1083 

’AFQTH9 -0.31644570 1.9844225 0.03 0.8733 

zSES -0.72608390 1.5248314 0.23 0.6340 

zAge +2.58214793 2.8423709 0.83 0.3636 

Basic anabysis, A dding Living Arrangements i vith Adults at A ge 14 


(Adult 14): 

Whole-Model Test 


Source 

DF -LogLikelihood (. 

TiiSquare 

Prob>ChiSq 

Model 

6 46.62389 93.24777 

0.000000 

Error 

1214 455.15091 



C Total 

1220 501.77480 




RSquare (U) 1 

0.0929 



Übservations 

1221 



Parameter Estimates 



Term 

Estimare Std Error 

ChiSquare 

Prob>ChiSq 

Intercept 

-1.8260275 0.1541482 

140.33 

0.0000 

zAFQT89 

-0.6620720 0.1259903 

27.61 

0.0000 

zSES 

-0.2460336 0.1221771 

4.06 

0.0440 

zAge 

-0.2109268 0.0909302 

5.38 

0.0204 

Adult 14 




[Z BiO'UnmarM 

[om] -0.2816545 0.1634249 

2.97 

0.0848 

AdultH 




[Bio/Step-UnmarMom] +0,2928507 0.206926 

2.00 

0.1570 

Adult 14 




[Other-UnmarMom] -0.4991684 0.3593261 

1.93 

0.1648 

Basic aruxlysis, 

Adding Presence of Biological Parents at Age 14 (l 4Bio): 


Whole-Model Test 



Source 

DF -LogLikelihood 

ChiSquare 

Prob>ChiSq 

Model 

6 47.56391 

95.12783 

0.000000 

Error 

1214 454.21088 



C Total 

1220 501.77480 




RSquare (U) 

0.0948 



Observations 

1221 
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Parameter Estimates 


Term 

Estimate 

Std Error 

ChiSquare 

Prob>ChiSq 

Intercept 

-2.0199123 

0.2037839 

98.25 

0.0000 

zAFQT89 

-0.6567746 

0.1250691 

27.58 

0.0000 

zSES 

-0.2479794 

0.1214895 

4.17 

0.0412 

zAge 

14Bio 

-0.2037178 

0.0910296 

5.01 

0.0252 

[MomOnly-PopOnly] 

14Bio 

+0.6528652 

0.2233927 

8.54 

0.0055 

[Mom/Pop'PopOnly] 

-0.0862208 

0.21023 35 

0.17 

0.6817 

14 Bio [N e i the r- PopOn ly] 

-0.2371231 

0.4150982 

0.33 

0.5678 


Basic Analysis, Adding Poverty Status in the Calendar Year Prior io Birth 
(PreBirthPov): 

Whole-Model Test 


Source 

DF 

-LogLikelihood 

ChiSquare 

Prob>ChiSq 

Model 

4 

63.21118 

126.4224 

0.000000 

Error 

956 

292,73717 



C Total 

960 

355.94835 




RSquare (U) 

0.1776 



Observations 

961 



Parameter Estimate; 



Term 

Estimate 

Std Error 

ChiSquare 

Proh>ChiSq 

Intercept 

-1.6785743 

0.1460018 

132.18 

0.0000 

zAFQT89 

-0.6300049 

0.1665952 

14.30 

0.0002 

zSES 

-0.1828877 

0.1513393 

1.46 

0.2269 

zAge 

-0.4759393 

0.127232 

13.99 

0.0002 

PreRirthPov 





[No-Yes] 

-0.8178684 

0.1266496 

41.70 

0.0000 

Basic Analyse 

s, Restricted to 

1 Women Below the Poierty Line 

in the C alen 

dar Year Prior 

to Birth: 





Whole-Model Test 



Source 

DF 

-LogLikelihood 

ChiSquare 

Proh>ChiSq 

Model 

3 

3.005867 

6.011735 

0.111041 

Error 

95 

65.003329 



C Total 

98 

68.009196 




RSquare (U) 

0.0442 



Observations 

99 



Parameter Estimates 


Term 

Estimate 

Std Error 

ChiSquare 

Proh>ChiSq 

Intercept 

-0.65306390 

0.2901964 

5.06 

0.0244 

zAFQT89 

-0.76887410 

0.345.3889 

4.96 

0.0260 

zSES 

+0.17993445 

0.2589166 

0.48 

0.4871 

zAge 

-0.13622880 

0.2289764 

0.35 

0.5519 


CHAPTER 9: WELFARE DEPENDENCY 

DEFENDENT VARIABLE: On welfare hy the first calendar year after the birth 
of the child. 

SAMPLE RESTRLCTIONS: Women with at least orte child bornprior to Jan- 
nary 1, 1989. 


Basic Analysis, Adding Poverty Status in the Year Prior to Birth (PreBirth¬ 
Pov) and Marital Status at the Time of the Birth (BStatus): 

Whole-Model Test 


Source 

DF 

-LogLikelihood 

ChiSquare 

Prob>ChiSq 

Model 

5 

100.37993 

200.7599 

0.000000 

Error 

833 

221.75844 



C Toral 

838 

322.13837 




RSquare (U) 

0.3116 



Observations 

839 



Parameter Estimates 


Term 

Esrimate 

Srd Error 

ChiSquare 

Proh>ChiSq 

Intercept 

-1.03594055 

0.1713324 

36.56 

0.0000 

zAFQT89 

-0.57972844 

0.1892548 

9.38 

0.0022 

zSES 

-0.06130137 

0.1746782 

0.12 

0.7256 

zA^e 

PreBirthPov 

-0.11269946 

0.1457313 

0.60 

0.4393 

[No'Yesl 

BStatus 

-0.89960808 

0.1446041 

38.70 

0.0000 

|lllegit-Legit] 

+ 1.05258560 

0.1352006 

60.61 

0.0000 

The High School Sample , Adding Poverty Status in the Year 

Prior tu Birth 

(PreBirthPov 

') and Marital Status at the Time 

of the Birth (BStatus): 


Whole-Model Test 


Souree 

DF 

-LogLikelihood 

ChiSquare 

Proh>ChiSq 

Model 

5 

29.28354 

58.56707 

0.000000 

Error 

384 

108.14153 



C Toral 

389 

137.42507 




RSquare (U) 

0.2131 



Observations 

390 



Parameter Estimates 


Term 

Estimate 

Std Error 

ChiSquare 

Prob>ChiSq 

Intercept 

-1.442341 10 

0.2659616 

29.41 

0.0000 

zAFQT89 

-0.607 35910 

0.3004261 

4.09 

0.0432 

zSES 

+0.12094082 

0.3096641 

0.15 

0.6961 

zAge 

-0.24139690 

0.2089849 

1.33 

0.2481 
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PrcBirthPov 


[NcvYcs] 

BStatus 

-0.67898980 

0.223275 

9.25 

0.0024 

[Illegit-Legit] 

+0.80812194 

0.205803.3 

15.42 

0.0001 


The College Sample: Omitted. Included no women who had received 
Aid to Families with Dependent Children (AFDC) within a year after 
the birth. 

DEPENDENT VARIABLE: On welfare for at least five years versus women 
with no welfare experience at all. 

SAMPLE RESTRICTIONS: Women with at least one child born prior to Jan- 
uary 1, 1986. For women scored as “no welfare,” child bom after De- 
cember.31, 1977 and complete data on welfare receipt from 1978 to 
1986. 


Basic Analysis, Adding Poverty Status in the Year Prior to Birth 
(PreBirthPov) and Marital Status at the Time of the Birth (BStatus): 

Whole-Model Test 


Source 

DF 

-LogLikelihood 

ChiSquare 

Prob>ChiSq 

Model 

5 

44.82635 

89.65269 

0.000000 

Error 

493 

96.90156 



C Total 

498 

141.72790 




RSquare 

(U) 

0.3163 



Observations 

499 




Parameter Estimates 


Term 

Estimate 

Std Error 

ChiSquare 

Prob>ChiSq 

Intercept 

-1.5840878 

0.2826002 

31.42 

0.0000 

zAFQT89 

-0.5506878 

0.2950687 

3.48 

0.0620 

zSES 

-0.4921959 

0.2779368 

3.14 

0.0766 

zAge 

-0.1094338 

0.2276355 

0.23 

0.6307 

PreRirrhPov 





[No-Yes] 

-0.7636358 

0.2336359 

10.68 

0.0011 

BStatus 





[Illegir-LegitJ 

+ 1.1951879 

0.205013 

33.99 

0.0000 

The High School Sample , Adding Poverty Statu: 

; in the Year Prior to Birth 

(PreBirthPov) and Marita/ Status at the Time of the Birth (BStatus): 



Whole-Model Test 



Source 

DF 

-LogLikelihood 

ChiSquare 

Pruh>ChiSq 

Model 

5 

13.898589 

27.79718 

0.000040 

Error 

251 

48.695997 



C Total 

256 

62.594585 




RSquare 

(U) 

0.2220 



Observations 

257 



Parameter Estimates 


Term 

Estimate 

Std Error 

ChiSquare 

Prob>ChiSq 

Intercept 

-1.7786656 

0.3901684 

20.78 

0.0000 

ZÄFQT89 

-0.2301309 

0.4429317 

0.27 

0.6034 

zSES 

-0.3131157 

0.4832739 

0.42 

0.5170 

zAge —0.0377430 

PreBirthPov 

0.3173131 

0.01 

0.9053 

[No-Yes] -0.6891978 

BStatus 

0.3355851 

4.22 

0.0400 

[1 lieg in Legi t] +1.1068557 

0.307595 

12.95 

0.0003 


The College Sample: Omitted. The cross-sectional College Sample included 
no women who were on chronic welfare within a year after the birth. 


CHAPTER 10: PARENTING 

HE PENDENT VARIABLE: Did the mother smoke during pregnancy? 
SAMPLE RESTRICTIONS: None. 


Basic Analysis: 

Source 
Model 
Error 
C Total 


Whole-Model Test 
DF -LogLikelihood ChiSquare 

3 84.6762 169.3523 

2338 1443.8251 

2341 1528.5013 

RSquare (U) 0.0554 

Observations 2342 

Parameter Estimates 


Term 

Intercept 

zAFQT89 

zSES 

zAge 


Estimate 

-0.65729780 

-0.63479220 

-0.13376440 

+0.09727632 


Std Error 

0.0465003 

0.0645408 

0.0604787 

0.0484283 


ChiSquare 

199.81 

96.74 

4.89 

4.03 


Proh>ChiSq 

0.000000 


Prob>ChiSq 

0.0000 

0.0000 

0.0270 

0.0446 


DEPENDENT VARIABLE: Low birth weight (weight less than 5.5 pounds at 
birth). 

SAMPLE RESTRICTIONS: Excludes premature babies whose weight was less 
than 5.5 Ibs. but was appropriate for gestational age. 


Basic Analysis: 


Source 
Model 
Error 
C Total 


Whole-Model Test 


DF -LogLikelihood 

3 6.55199 

2273 349.79375 

2276 356.34574 

RSquare (U) 

Observations 


ChiSquare 

13.10397 


0.0184 

2277 


Prob>ChiSq 

0.004417 
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Parameter Estimates 


Term 

Estimate 

Std Error 

ChiSquare 

Prob>ChiSq 

Intercept 

-140600010 

0.1270004 

719.25 

0.0000 

zAFQT89 

-0.44308170 

0.1496847 

8.76 

0.0031 

zSES 

+0.03312669 

0.1492929 

0.05 

0.8244 

zAge 

+0.26896236 

0.1226929 

4.81 

0.0284 


Basic Analysis, Adding Poverty Status in the Year Prior to Birth (PreBirtC 
Pov): 

Whole-Model Test 


Source 

DF -LogLikelihood 

ChiSquare 

Prob>ChiSq 

Model 

4 9.09299 

18.18599 

0.001135 

Error 

1859 298.98002 



C Total 

1863 308.07301 




RSquare (U) 

0.0295 



Ohservations 

1864 



Parameter Estimates 


Term 

Estimate Std Error 

ChiSquare 

Proh>ChiSq 

Intercept 

-3.12509860 0.16455 

360.69 

0.0000 

zAFQT89 

-0.455838OO 0.1674174 

7.41 

0.0065 

zSES 

+0.02995737 0.1628609 

0.03 

0.8541 

zAge 

+0.34292817 0.1.342861 

6.52 

0.0107 

PreBirthPov 




[No-Yesl 

-0.28644950 0.15725 

3.32 

0.0685 


Basic AnaFysis, Adding Mother's Age at Birth (AgeBirth): 

Whole-Model Test 


Source 

DF -LogLikelihood 

ChiSquare 

Proh>ChiSq 

Model 

4 6.77955 

13.55909 

0.008844 

Error 

2272 349.56619 



C Total 

2276 356.34574 




RSquare (U) 

0.0190 



Ohservations 

2277 



Parameter Estimates 


Term 

Estimate Std Error 

ChiSquare 

Proh>ChiSq 

Intercept 

-3.90965790 0.7617514 

26.34 

0.0000 

zAFQT89 

-0.46251520 0.1522804 

9.22 

0.0024 

zSES 

+0.01584480 0.1517047 

0.01 

0.9168 

zAge 

+0.25360789 0.1252257 

4.10 

0.0428 

AgeBirth 

+0.02095854 0.0311104 

0.45 

0.5005 
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The High School Sample: 

Whole-Model Test 


Source 

DF 

-LogLikelihood 

ChiSquare 

Proh>ChiSq 

Model 

3 

1.96999 

8.98998 

0.268019 

Error 

944 

179.09080 



C Total 

947 

181.06079 




RSquare 

(U) 

0.0109 



Ohservations 

948 




Parameter Estimates 



Term 

Estimate 

Std Error 

ChiSquare 

Proh>ChiSq 

Intercept 

-3.1278597 

0.1778908 

809.16 

0.0000 

zAFCT89 

-0.3560319 

0.2387034 

2.22 

0.1358 

zSES 

+0.0653651 

0.2379847 

0.08 

0.7836 

zAge 

+0.2490558 

0.1681270 

2.19 

0.1385 


The College Sample: Omitted. The cross' sec tional College Sample 
included only four low-TirtFoweight bahies. 

DEFENDENT VARIABLE: The child’s mother was under the poverty line 
throughout the child’s first three years ot life. 

SAMPLE RESTRicmONS: Includes children born from January 1, 1978 
through December 31, 1987, with complete data on poverty for the 
first three years of the child’s life, beginning with the calendar year 
of hirth. Comparison group consists of children of mothers who were 
not in poverty in any of those years. 

Basic Analysis: 

Whole-Model Test 

Source HF -LogLikelihood ChiSquare Prob>ChiSq 

Model 3 79.84242 159.6848 0.000000 

Error 1054 246.63029 

C Total 1057 326.47271 

RSquare (U) 0.2446 

Ohservations 1058 

Parameter Estimates 

Term Estimate Std Error ChiSquare Proh>ChiSq 

Intercept -2.9319316 0.1679177 304.87 0.0000 

zAFQT89 -1.1608860 0.1893877 37.57 0.0000 

zSES -1.0386253 0.1734586 35.85 0.0000 

zAge -0.1837537 0.1320334 1.94 0.1640 
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Basic Analysis, Adding Poverty Status in the Year Prior to Birth (PreBirthPov): 




Whole-Model Test 



Source 

DF 

-LogLikelihood 

ChiSquare 

Proh>ChiSq 

Model 

4 

133.38437 

266.7687 

0.000000 

Error 

967 

161.88379 



C Total 

971 

295.26816 




RSquare (U) 

0.4517 



Observations 

972 




Parameter Estimates 


Term 

Estimate 

Std Error 

ChiSquare 

Proh>ChiSq 

[ntercept 

-1.9685017 

0.2117444 

86.43 

0.0000 

zAFQT89 

-1.0772447 

0.2375948 

20.56 

0.0000 

zSES 

-0.8977385 

0.2215879 

16.41 

0.0001 

zAge 

PreBirthPov 

+0.0117316 

0.1681889 

0.00 

0.9444 

[No-Yes] 

-1.7345986 

0.1635206 

112.53 

0.0000 


The High School Sample, Adding Poverty Status in the Year Prior to Birth 
(PreBirthPov): 

Whole-Model Test 


Source 

DF 

-LogLikelihood 

ChiSquare 

Prob>ChiSq 

Model 

4 

1 33.38437 

266.7687 

0.000000 

Error 

967 

161.88379 



C Total 

971 

295.26816 




RSquare (U) 

0.4517 



Observations 

972 




Parameter Estimates 


Term 

Estimate 

Std Error 

ChiSquare 

Proh>ChiSq 

1 ntercept 

-1.9685017 

0.2117444 

86.43 

0.0000 

zAFQT89 

-1.0772447 

0.2375948 

20.56 

0.0000 

zSES 

-0.8977385 

0.2215879 

16.41 

0.0001 

2 Age 

+0.0117316 

0.1681889 

0.00 

0.9444 

PreBirthPov 





[No-Yes] 

-1.7345986 

0.1635206 

112.53 

0.0000 


The College Sample: Omitted. The cross-sectional College Sample 
included only one child whose mother was beneath the poverty line 
rhroughout the first three years. 

OHPENDENT VARIABLE: The child s HOME index score was in the hottom 
decile. 

SAMPLE RESTRKTIONS: None. 

Additional control variables: Test year (TestYr, nominal: 1986, 1988, or 
1990) and the child’s age category for scoring the HOME index 
(HomeAgeCat, nominal, in years: 0/2, 3/5, 6+). 


Basic Analysis: 

Source 
Model 
Error 
C Total 


Whole-Model Test 
ET -LogLikelihood 

7 88.9225 

5114 1190.6267 

5121 1279.5492 

RSquare (U) 

Observations 


C di i Square 
177.8451 


0.0695 

5122 


Prob>ChiSq 

0.000000 


Parameter Estimates 


Term 

Estimate 

Std Error 

ChiSquare 

Proh>ChiSq 

1 ntercept 

-2.8430001 

0.0687859 

1708.3 

0.0000 

zAFQT89 

-0.6710186 

0.0765998 

76.74 

0.0000 

zSES 

-0.2383458 

0.0800828 

8.86 

0.0029 

zAge 

-0.1428139 

0.062902 

5.15 

0.0232 

TestYr 

[86-901 

+0.0128625 

0.0858087 

0.02 

0.8808 

TestYr 

188-90] 

-0.0414196 

0.0798373 

0.27 

0.6039 

HomeAgeCat 

[0/2-6+] 

+0.3225819 

0.081541 

15.65 

0.0001 

1 lomeAgeCat 
13/5-6+1 

-0.1338273 

0.0852061 

2.47 

0.1163 

Basic Analysis, 

Adding Poverty Status in the Year Before the 

HOME Inde: 


mis Scored (PreTYPov): 

Source OF 

Model 8 

Error 4655 

C Total 4663 


Whole-Model Test 
-LogLikelihood ChiSquare 

116,4719 232.9438 

1049.6688 
1166.1407 


RSquare (U) 0.0999 

Observations 4664 


Prob>ChiSq 

0.000000 


Parameter Estimates 


Term 

lntercept 

zAFQT89 

zSES 

zAge 

Estimate 

-2.5413180 

-0.5717052 

-0.1646842 

-0.0836204 

Std Error 

0.0768882 

0.0847651 

0.0848268 

0.0673282 

ChiSquare 

1092.4 

45.49 

3.77 

1.54 

Proh>ChiSq 

0.0000 

0.0000 

0.0522 

0.2142 

TestYr 

[86-90] 

0.9397 

Unstable 

+0.0068172 

0.0900515 

0.01 

TestYr 

[88-90] 

-0.0538353 

0.0851491 

0.40 

0.5272 

HomeAge 

Cat[0/2-6+] 

+0.3100371 

0.0867081 

12.79 

0.0003 
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HomeAge 


Car[3/5-6+] 

PreTYPov 

-0.0968535 

0.0892661 

1.18 

0.2779 

[No-Yes] 

-0.5366001 

0.0664395 

65.23 

0.0000 


Basic Analysis, Adding AFDC Status in ehe Year Before the HOME Index 
Was Scored (Pre TYADC): 


Whole-Model Test 


Source 

DF 

-LogLikelihood 

ChiSquare 

Prob>ChiSq 

Model 

Error 

C Total 

8 120.4866 240.9733 

5101 1150.3749 

5109 1270.8615 

RSquare (U) 0.0948 

Observations 5110 

Parameter Estimates 

0.000000 

Term 

Estimate 

Std Error 

ChiSquare 

Prob>ChiSq 

Inrercept 

-2.4639335 

0.0797203 

955.26 

0.0000 

zAFQT89 

-0.5835098 

0.078223 

55.65 

0.0000 

zSES 

-0.1973545 

0.0813485 

5.89 

0.0153 

zAge 

-0.0908713 

0.0644499 

1.99 

0.1586 

TestYr[86-90] 

-0.0105341 

0.0872339 

0.01 

0.9039 

TestYr[88-90] 

HomeAge 

-0.0232495 

0.0811592 

0.08 

0.7745 

Car[0/2-6+] 

HomeAge 

+0.3429802 

0.0829005 

17.12 

0.0000 

Car]3/5-6+] 

PreTYADC 

-0.1348740 

0.0863764 

2.44 

0.1184 

[No-Yes] 

-0.5572417 

0.0680104 

67.13 

0.0000 


Basic Analysis, Adding Both Poverty and AFÜC Status in the Year Before 
the HOME Index was Scored (PreTYPuv, PreTYADC): 

Whole-Model Test 

Source DF -LogLikelihood C ’hiSquare Proh>ChiSq 

M °del 9 127.1525 2543049 0.000000 

Error 4654 1038.9883 

C Total 4663 1166.1407 

RSquare (U) 0.1090 

Observations 4664 

Parameter Estimates 

Tcrm Estimate Std Error ChiSquare Prob>ChiSq 

Inrercept -2.3642864 0.0832843 805.89 0.0000 

2.AFQT89 -0.5452068 0.0849779 41.16 0.0000 

zSES -0.1657978 0.0852414 3.78 0.0518 

zAge -0.0664416 0.0679088 0.96 0.3279 
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TestYr[86-90| 

0.9743 

LJnsrable +0.0029083 

0.0904431 

0.00 

TesrYr[88-90] 

-0.0455863 0.0856239 

0.28 

0.5944 

HomeAge 

Cat|0/2-6+] 

+0.3145455 0.087279 

12.99 

0.0003 

HomeAge 

Cat[3/5-6+] 

-0.1002522 0.0896764 

1.25 

0.2636 

PreTYADC 

[No-Yes] 

-0.3806916 0.0809799 

22.10 

0.0000 

PreTYPov 

[No-Yes] 

-0.3774093 0.0762828 

24.48 

0.0000 

The High School Sample: 




Whole-Model Test 



Source 

DF -LogLikelihood 

ChiSquare 

Prob>ChiSq 

Model 

7 26.90513 

53.81026 

0.000000 

Error 

2282 526.92206 



C Total 

2289 553.82719 

RSquare (LJ) 

0.0486 



Observations 

2290 



Parameter Estimates 


Term 

Estimate 

Std Error 

ChiSquare 

Prob>ChiSq 

Inrercept 

-2.9071274 

0.1079507 

725.2 3 

0.0000 

2AFQT89 

-0.5655610 

0.1331445 

18.04 

0.0000 

zSES 

-0.3731384 

0.1355456 

7.58 

0.0059 

zAge 

-0.1569221 

0.0964674 

2.65 

0.1038 

TesrYr|86-90] 

+0.0755310 

0.1295874 

0.34 

0.5600 

TestYr[88-90] 

-0.1487970 

0.1239539 

1.44 

0.2300 

HomeAge 

Cat|0/2-6+] 

+0.3159089 

0.1236104 

6.53 

0.0106 

HomeAge 

Cat[3/5-6+] 

-0.0254850 

0.1242055 

0.04 

0.8374 


The College Sample: Omitted. The cross-sectional sample included 

only five cases of children in the bottom decile on rhe HOME index. 

DEPENDENT VARIABLE: The child was in the bottom decile on any of the 
four developmental indicators (friendliness index, difficulty index, 
motor and social development index, and behavioral problems 
index). 

SAMPLE RESTRICTIONS: None. 

additional CONTROL VARIABLES: Test year (TestYr, nominal: 1986, 
1988, or 1990). 
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Basic Analysis: 





Whole-Model Test 


Source 

DF -LogLikclihood 

ChiSquare 

Prob>ChiSq 

Model 

5 .35.5004 

71.00086 

0.000000 

Error 

4885 1534.3911 



C Total 

4890 1569.8915 




RSquare (U) 

0.0226 



Observations 

4891 



Parameter Estimates 


Term 

Estimate Std Error 

ChiSquare 

Proh>ChiSq 

Intercept 

-2.2678463 0.0523382 

1877.5 

0.0000 

zAFQT89 

-0.3374850 0.0666453 

25.64 

0.0000 

zSES 

-0.1454605 0.0662047 

4.83 

0.0280 

zAge 

-0.0406925 0.0531744 

0.59 

0.4441 

TestYr[86-90] 

+0.1789367 0.0698843 

6.56 

0.0105 

TestYr[88'90] 

-0.0070670 0.0677961 

0.01 

0.9170 


Basic Analysis, Adding Poverty Status and Welfare Status in the Ycar Prior 
to Testing (PreTYPov , PreTYADC) and Whether the Child was Born out 


of Wedlock (BStatus): 




Whole-Model Test 


Source 

DF -LogLikelihood 

ChiSquare 

Prob>ChiSq 

Model 

8 42.9933 

85.98651 

0.000000 

Error 

4329 1350.0000 



C Total 

4337 1392.9933 




RSquare (U) 

0.0309 



Observations 

4338 



Parameter Estimates 


Term 

Estimate Std Error 

ChiSquare 

Proh>ChiSq 

Intercept 

-2.00631470 0.0860525 

543.59 

0.0000 

zAFQT89 

-0.25174490 0.075657 

11.07 

0.0009 

zSES 

-0.13270420 0.0708367 

3.51 

0.0610 

zAge 

+0.01726122 0.0575776 

0.09 

0.7643 

Test Y r[86-90] 

+0.18566228 0.0735475 

6.37 

0.0116 

TestYr [88-90] 

+0.01877328 0.072493 

0.07 

0.7957 

PreTYADC 




[Yes-No] 

+0.13056720 0.0820003 

2.54 

0.1113 

PreTYPov 




[Yes-No] 

+0.24199190 0.0714256 

11.48 

0.0007 

BStatus 




[Illegit-Legit] 

+0.01707707 0.0764089 

0.05 

0.82 31 
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The High School Sample: 

Whole-Model Test 


Source 

DF -LogLikelihood 

ChiSquare 

Proh>ChiSq 

Model 

5 13.59824 

27.19647 

0.000052 

Error 

2181 704.58153 



C Total 

2186 718.17976 




RSquare (U) 

0.0189 



Observations 

2187 



Parameter Estimates 


Term 

Estimate Std Error 

ChiSquare 

Prob>ChiSq 

Intercept 

-2.3178135 0.0834745 

770.99 

0.0000 

zAFQT89 

-0.3193097 0.1100786 

8.41 

0.0037 

zSES 

-0.3161019 0.1 113263 

8.06 

0.0045 

zAge 

+0.0231487 0.0778738 

0.09 

0.766.3 

TestYr[86-90] 

+0.1566625 0.1029997 

2.31 

0.1283 

TestYr[88-90] 

+0.0136187 0.0996255 

0.02 

0.8913 

The College Sample: 




Whole-Model Test 


Source 

DF -LogLikelihood 

ChiSquare 

Prob>ChiSq 

Model 

5 5.166097 

10.33219 

0.066352 

Error 

346 74.39592 3 



C Total 

351 79.562020 




RSquare (U) 

0.0649 



Observations 

352 



Parameter Estimate; 

s 


Term 

Estimate Srd Error 

ChiSquare 

Prob>ChiSq 

Intercept 

-3.0081530 0.530244 

32.18 

0.0000 

zAFQT89 

+0.78938018 0.3581312 

4.86 

0.0275 

zSES 

-0.80898430 0.3371107 

5.76 

0.0164 

zAge 

Unstable +0.01498142 

0.2822683 

0.00 

0.9577 




TestYr[86-90] 

+0.41149788 0.3719686 

1.22 

0.2686 

Te.stYr[88-90] 

-0.34603300 0.3626176 

0.91 

0.3399 


DEPENDENT VARIABLE: The child was in the bottom decile on the 
Peabody Picture Vocabulary Test (PPVT). 

SAMPLE RESTRICTIONS: Includes only children tested at age 6 and older. 
ADDITIONAL CONTROL VARIABLES: Test year (TestYr, nominal: 1986, 
1988, or 1990) and age at which the child was tested (continuous, in 
months. m = 107.0, s = 27.1) 



618 Appendix 4 


AppeyuLix 4 619 


Basic AnHrsis; 




Whole-Model Test 


Source 

DF 

-LogLikelihood 

ChiSquare 

Prob>ChiSq 

Model 

6 

24.69587 

49.39173 

0.000000 

Error 

640 

186.29121 



C Total 

646 

210.98708 




RSquare 

(U) 

0.1170 



Observations 

647 



Parameter Estimates 


Term 

Estimate 

Std Error 

ChiSquare 

Proh>( IhiSq 

Inrercepr 

-2.21570603 

0.8589707 

6.65 

0.0099 

zAFQT89 

-1.11994138 

0.1950498 

32.97 

0.0000 

:SES 

-0.08185312 

0.1820132 

0.20 

0.6529 

zArc 

-0.02769682 

0.1856376 

0.02 

0.8814 

PPVTAge 

-0.00466266 

0.0077779 

0.36 

0.5489 

TestYr[86'90] 

-0.16528217 

0.2424523 

0.46 

0.4954 

TestYr[88-90] 

-0.07970146 

0.2250145 

0.13 

0.7232 

Basic Analysis 

A clding Poverty Status in the Year Prior to the 

PPVT 

(PreTYPov) and the HOME Index Score Expressed in Standard Scores 

(zHOME): 






Whole-Model Test 


Source 

DF -LogLikelihocxl 

ChiSquare 

Proh>ChiSq 

Model 

8 17.72094 

35.44187 

0.000022 

Error 

582 153.59135 



C Total 

590 17 

1.31229 




RSquare (U) 


0.1034 



Observations 


591 



Parameter Estimates 


Term 

Esrimare 

Std Error 

ChiSquare 

Proh>ChiSq 

Intercept 

-2.1291342 

0.9565224 

4.95 

0.0260 

zAFQT89 

-1.0337219 

0.2205373 

21.97 

0.0000 

zSES 

-0.0861738 

0.2093703 

0.17 

0.6806 

zAge 

+0.0296014 

0.2145926 

0.02 

0.8903 

TestYr[86-90] 

-0.2286597 

0.2683805 

0.73 

0.3942 

Test Y r[88-90] 

-0.0483017 

0.2526343 

0.04 

0.8484 

PPVTAge 

-0.0067930 

0.0085395 

0.63 

0.4263 

zHOME 

-0.1945375 

0.1842224 

1.12 

0.2910 

PreTYPov 





[No-Yes] 

+0.0903504 

0.2016689 

0.20 

0.6541 


The High School Sample: 

Whole-Model Test 


Source 

DF -L 

-ogLikelihüod 

ChiSquare 

Prob>ChiSq 

Model 

6 7.225514 

14.45103 

0.024984 

Error 

254 68.236589 



C Total 

260 75.462103 




RSquare (U) 


0.0958 



Observations 


261 



Parameter Estimates 


Term 

Estimate 

Std Error 

ChiSquare 

Prob>ChiSq 

Intercept 

-0.2705795 

1.4383055 

0.04 

0.8508 

zAFQT89 

-0.9296387 

0.3952333 

5.53 

0.0187 

zSES 

-0.0918753 

0.3493501 

0.07 

0.7926 

zA K e 

+0.9267613 

0.41 37423 

5.02 

0.0251 

TesrYr[86-90| 

-1.0895230 

0.4214316 

6.68 

0.0097 

TestYrl 88-90| 

+0.3167489 

0.3591565 

0.78 

0.3778 

PPVTArc 

-0.0287800 

0.0132748 

4.70 

0.0302 


The College Sample: Omitred. No cases of a child age 6 or older in the 
bottom decile on the PP VT in the cross-sec tional sample. 

Wotticti tmh Less Than a High School Education: 

Whole-Model Test 


Source 

DF -LogLikelihood 

ChiSquare 

Proh>ChiSq 

Model 

6 6.239129 

12.47826 

0.052111 

Error 

139 70.537266 



C Total 

145 76.776395 




RSquare (U) 


0.0813 



Observations 


146 



Parameter Estimates 


Tenn 

Estimate 

Std Error 

ChiSquare 

Proh>ChiSq 

Intercept 

-3.6384326 

1.460085 

6.21 

0.0127 

zAFQT89 

-0.8396200 

0.3093784 

7.37 

0.0066 

zSES 

+0.0090359 

0.2914784 

0.00 

0.9753 

-Arc 

-0.4386216 

0.3016201 

2.11 

0.1459 

TestYrl 86-90| 

+0.3771342 

0.3759239 

1.01 

0.3158 

TestYr[88-901 

-0.4974755 

0.3638669 

1.87 

0.1716 

PPVTAge 

+0.0133480 

0.011851 

1.27 

0.2600 
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CHAPTER 11 s CRIME 

DEPENDENT VARIABLE: The subject was in the top decile on an index of 
self-reported crime. 

SAMPLE RESTRICTIONS: lncludes only men. 

Basic Analysis: 



Whole-Model Test 


Source 

DF -LogLikelihood 

ChiSquare 

Proh>ChiSq 

Model 

3 10.02735 

20.05469 

0.000165 

Error 

2004 649.74218 



C Total 

2007 659.76953 




RSquare (U) 

0.0152 



Observations 

2008 



Parameter Estimates 


Term 

Estimate Std Error 

ChiSquare 

Proh>ChiSq 

Intercept 

-2.22005314 0.0807852 

755.20 

0.0000 

zAFQT89 

-0.26980189 0.0902397 

8.94 

0.0028 

zSES 

+0.13972790 0.0979853 

2.03 

0.15 39 

zAge 

-0.20372081 0.080365 

6.43 

0.0112 

The High School Sample: 




Whole-Model Test 


Source 

DF -LogLikelihood 

ChiSquare 

Prob>ChiSq 

Model 

3 4.28228 

8.564558 

0.035677 

Error 

661 201.83770 



C Total 

664 206.11998 




RSquare (U) 

0.0208 



Observations 

665 



Parameter Estimates 


Term 

Estimate Std Error 

ChiSquare 

Prob>ChiSq 

Intercepr 

-2.35032467 0.1445857 

264.24 

0.0000 

zAFQT89 

+0.2120838 0.2006406 

1.12 

0.2905 

zSES 

+0.3653400 0.1981511 

3.40 

0.0652 

zAjje 

-0.26122639 0.1457019 

3.21 

0.0730 

The College 

Sample: 




Whole-Model Test 


Source 

DF -LogLikelihood 

ChiSquare 

Prob>ChiSq 

Model 

3 2.959829 

5.919657 

0.115585 

Error 

276 46.577870 



C Total 

279 49.537698 




RSquare (U) 

0.0597 



Observations 

280 



Term 

Intercept 

ZÄFQT89 

zSES 

zAge 


Parameter Estimates 

ChiSquare Proh>ChiSq 


Esrimate Std Error 
-3.33070801 0.7047663 
-0.63468357 0.5194501 
+0.80027390 0.4591207 
+0.39913230 0.306701 


22.33 

0.0000 

1.49 

0.2218 

3.04 

0.0813 

1.69 

0.1931 


DEFENDENT VARIABLE: The subject was interviewed in a correctional fa- 
cility in one or more interviews from 1979 to 1990. 

SAMPLE RESTRICTIONS: lncludes only men. 


Basic Analysis: 

Whole-Model Test 


Source 

DF -LogLikelihood 

ChiSquare 

Prob>ChiSq 

Model 

3 23.31444 

46.62887 

0.000000 

Error 

1941 219.90125 



C Total 

1944 243.21569 




RSquare (U) 

0.0959 



Observations 

1945 



Parameter Estimates 


Term 

Estimate Std Error 

ChiSquare 

Prob>ChiSq 

Intercept 

-3.77716689 0.1717938 

483.41 

0.0000 

zAFQT89 

-0.89666260 0.1753619 

26.14 

0.0000 

zSES 

-0.15554116 0.1806149 

0.74 

0.3891 

zAge 

+0.0782992 0.1468634 

0.28 

0.5939 

The High School 

Sample: 




Whole-Model Test 


Source 

DF -LogLikelihood 

ChiSquare 

Prob>ChiSq 

Model 

3 4.058464 

8.116928 

0.043656 

Error 

712 39.850585 



C Total 

715 43.909049 




RSquare (U) 

0.0924 



Observations 

716 



Parameter Estimates 


Term 

Estimate Std Error 

ChiSquare 

Proh>ChiSq 

Intercept 

-4.96578763 0.4806319 

106.75 

0.0000 

zAFQT89 

-1.07006679 0.443121 

5.83 

0.0157 

zSES 

-0.16211965 0.4642977 

0.12 

0.7270 

zAge 

+0.46727190 0.367754 

1.61 

0.2039 


The College Sample: Omitted. No one in the cross-sectional College 
Sample was ever interviewed in jail. 
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CHAPTER 12: CIV1L1TY AND CITIZENSHIP 

DEFENDENT VARIABLE: Did the subject score “yes” on the Middle Class 
Values Index? 

SAMPLE RESTRICTIONS: Excludes nevenmarried persons who met all che 
other conditions of the index and men who were physically unahle 
to work or not in the labor force because they were aftending school. 


Basic Analysis: 

Whole-Model Test 


Source 

DF 

-LogLikelihood 

ChiSquare 

Prob>ChiSq 

Model 

3 

161.7136 

32 3.4273 

0.000000 

Error 

3025 

1937.4328 



C Total 

3028 

2099.1465 




RSquare (U) 

0.0770 



Observations 

3029 



F 

Gramerer Estimates 


Term 

Estimate 

Std Errur 

ChiSquare 

Proh>( ,'hiSq 

Intercept 

-0.06385330 

0.038934 

2.69 

0.1010 

zAFQT89 

+0.63250551 

0.0528176 

143.41 

0.0000 

zSES 

+0.24495537 

0.0520624 

22.14 

0.0000 

zAge 

+0.00663732 

0.0401929 

0.03 

0.8688 

The High School Sample: 





Whole-Model Test 



Source 

DF 

-LogLikelihood 

ChiSquare 

Proh>ChiSq 

Model 

3 

3.00926 

6.018528 

0.110712 

Error 

1158 

781.85686 



C Total 

1161 

784.86612 




RSquare (U) 

0.0038 



Observations 

1162 



Parameter Estimates 


Term 

Esrimare 

Std Error 

ChiSquare 

Proh>ChiSq 

Intercept 

+0.39447706 

0.0611821 

41.57 

0.0000 

zAFQT89 

+0.16814512 

0.0931181 

3.26 

0.0710 

zSES 

-0.17993040 

0.0903402 

3.97 

0.0464 

zAge 

+0.01887678 

0.0621776 

0.09 

0.7614 


The College Sample: 

Whole-Model Test 


Source 

DF 

-LogLikelihood 

ChiSquare 

Prob>ChiSq 

Model 

3 

3.26859 

6.537177 

0.088208 

Error 

398 

200.09145 



Ü Total 

401 

203.36004 




RSquare (LJ) 

0.0161 



Observations 

402 



Parameter Estimates 


Term 

Esrimare 

Std Error 

ChiSquare 

Prob>ChiSq 

Intercept 

+0.99516202 

0.2386798 

17.38 

0.0000 

zAFQT89 

+0.39251349 

0.1988073 

3.90 

0.0483 

zSES 

+0.03692158 

0.168585 

0.05 

0.8266 

zAge 

+0.13876137 

0.1336384 

1.08 

0.2991 
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Supplemental Material for 
Chapter 13 


Three issues raised in Chapter 13 are elaborated here: a more detailed 
discussion of cultural bias, more evidence for the narrowing of the black' 
whitedifference incognitiveability, and rhe broaderargument forracia) 
differences advanced by Philippe Rushton, 

MORE ON TEST BIAS 

ln Chapter 13, we reported that the scientific evidence demonstrates 
overwhelmingly that Standardized tests of cognitive ability are not bi' 
ased against blacks. Here, we elahorate on the reasoning and evidence 
that lead to that conclusion. 

More on Extemal Evidence of Bias: Predictive Validity 

Everyday commentary on test hias usually Starts with the observation that 
members of various ethnic (or socioeconomic) groups have different av' 
erage scores and leaps to the assumption that a group difference is prima 
facie evidence of hias. But a moment s thought should convince anyone 
that this is not necessarily so. A group difference is, in and of itself, evi' 
dence of test bias only if we have some reason for assuming that an um 
biased test would find no average difference between the groups. What 
might such a reason be ? We cast the answer in terms of whites and blacks, 
since that is the context for most charges of test bias. Inasmuch as the com 
text also usually involves a criticism of the use of the test in selection of 
persons for school or job, the most pertinent reason for assuming equality 
in the absence of test bias would be that we have other data showing that 
a randomly selected black and white with the same test score have differ¬ 
ent outcomes. This is what the text refers to as extemal evidence of bias. 
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If for example, blacks do better in school than whites after choosinq 
blacks and whites with equal test scores, we could say that rhe test was 
biased against blacks in academic predicrion. Similarly, if they do bet' 
ter on the job after choosing blacks and whites with equal test scores, 
the test could be considered biased against blacks for predicting work 
performance. This way of demonstrating bias is tantamount to showing 
that the regression of outcomes on scores dififers for the two groups. On 
a test biased against blacks, the regression intercept would be higher for 
blacks than whites, as illustrated in the graphic below. Test scores uiv 

When a test is biased because it systematically underpredicts one 
group’s performance 

Outcome measure 



der these conditions would underestimate, or “underpredict,” the per¬ 
formance outcome of blacks. A randomly selected black and white with 
the same IQ (shown by the vertical broken line) would not have equal 
outcomes; the black would outperform the white (as shown by the hor¬ 
izontal broken lines). The test is therefore biased against blacks. On an 
unbiased test, the two regression lines would converge because they 
would have the same intercept (the poinr at which rhe regression line 
crosses the vertical axis). 

Rur the graphic above captures only one of the many possible mani- 
festations of predictive bias. Suppose, for example, a test was less valid 
for blacks than for whites. 1 ' 1 In regression terms, this would translate into 
a smaller coefficient (slope in these graphics), which could, in turn, be 
associated either with or withour a difference in the intercept. The next 
figure illustrates a few hypothetical possibilities. 

All three black lines have the same low coefficient; they vary only 


When a test is biased because it is a less valid predictor of perfor^ 
mance for one group than another 

Outcome measure 



in their intercepts. The gray line, representing whites, has a higher co¬ 
efficient (therefore, the line is steeper). Begin with the lowest of the 
three black lines. Only at the very lowest predictor scores do blacks score 
higher than whites on the outcome measure. As the score on the pre¬ 
dictor increases, whites with equivalent predictor scores have higher 
outcome scores. Here, the test bias is against whites, not blacks. For the 
intermediate black line, we would pick up evidence for test bias against 
blacks in the low ränge of test scores and bias against whites in the high 
ränge. The top black line, with the highest of the three intercepts, would 
accord with bias against blacks throughout the ränge, but diminishing 
in magnitude the higher the score. 

Readers will quickly grasp that test scores can predict outcomes dif- 
ferently for memhers of different groups and that such differences may 
justify claims of test bias. So what are the facts? Do we see anything like 
the first of the two graphics in the data—a clear difference in intercepts, 
to the disadvantage of blacks taking the test? Or is the picture cloudier— 
a mixture of intercept and coefficient differences, yielding one sort of 
bias or another in different ranges of the test scores? When questions 
about data come up, cloudier and murkier is usually a safe bet. So let us 
Start with the most relevant conclusion, and one about which there is 
virtual unanimity among students of the subject of predictive bias in 
testing: No one hos found statistically reliable evidence of predictive bias 
against blacks, of the sort illustrated in the first graphic , in large, rejrresenta - 
tive samples of blacks and whites, where cognitive ability tests are the predic¬ 
tor variable for educational achievernent or job performance. In the notes, 
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we list some of the larger aggregations of data and comprehensi ve analy- 
ses substantiating this conclusion.“ We have found no modern, empiri- 
cally hased survey of the literature on test bias arguing that tests are 
predictively biased against blacks, although we have looked for them. 

When we tum to the hundreds of smaller studies that have accu- 
mulated in the literature, we find examples of varying regression coef- 
ficients and intercepts, and predictive validities. This is a fundamental 
reason for focusing on syntheses of the literature. Smaller or unrepre- 
sentative individual studies may occasionally find test bias becau.se of 
the Statistical distortions that plague them. There are, for exatnple, sam¬ 
pling and measurement errors, errors of recording, transcribing, and 
computing data, restrictions of ränge in both the predictor and outcome 
measurements, and predictor or outcome scales that are less valid than 
they mighr have been.’ Given all the distorting sources of Variation, 
lack of agreement across studies is the rule. 

But even taken down to so fine a level, the case against predictive 
bias against blacks remains overwhelming. As late as 1984, Arthur 
Jensen was able to proclaim that “1 have not come across a bona f'ide ex- 
ample of the opposite finding [of a test that underpredicts black perfor- 
mance].” 4 Jensen’s every finding regarding racial differences in IQ is 
routinely subjected to inrense scrutiny by his critics, Hut no one bas con- 
tradicted this one. We are not absolutely sure that our literature review 
has identified every study since 1984, but our search revealed no ex¬ 
amples to counter Jensen’s generalization. |5i 

Insofar as the many individual studies show a pattem at all, ir points 
to overprediction for blacks. More simply, this body of evidence suggests 
that IQ tests are biased in favor of blacks, not against them. The single 
mosr massive set of data bearing on this issue is the national sample of 
more than 645,000 school children conducted by sociologist James 
Coleman and his associates for their landmark examination of the 
American educational System in the mid-1960s. Coleman’s survey in- 
cluded a standardized test of verbal and nonverbal IQ, using the kinds 
of items that characterize the classic IQ test and are commonly thought 
to be culturally biased against blacks: picture vocabulary, sentence com- 
pletion, analogies, and the like. The Coleman survey also included ed- 
ucational achievement measures of reading level and math level that 
are thought to be straightforward measures of what the Student has 
leamed. If IQ items are culturally biased against blacks, it could be pre- 
dicted that a black Student would do berter on the achievement mea- 


sures than the putative IQ measure would lead one to expect (this is the 
rationale behind the current popularity of Steps to modify the SAT so 
rhat it focuses less on aptitude and more on measures of what has been 
learned). But the opposite oecurred. Overall, black IQ scores overpre- 
dicted black academic achievement by .26 Standard deviations. 6 

One inference that might be drawn frotn this finding is that black 
children were for some reason not taking as much from school as their 
ability would permit, or that black children went to worse schools than 
white children, or any of several other interpretations. But whatever the 
explanation might be, the results directly contradict the hypothesis that 
IQ tests give an unfairly low estimate of black academic pcrformance, 

A second inajor source of data suggesting that standardized tests oven 
predict black performance is the SAT. Colleges commonly compare the 
pcrformance of freshmen, measured by grade point average, against the 
expectations of their performance as predicted by SAT scores. A liter- 
ature review of studies that broke down these data by ethnic group re¬ 
vealed that SAT scores overpredieted freshman grades for blacks in 
fourteen of fifteen studies, by a median of .20 Standard deviation . 7 In 
five additional studies where the ethnic classification was “minority” 
rather than specifically “black,” the SAT score overpredieted College 
performance in all five cases, by a median of .40 Standard deviation.* 

For job performance, the most thorough analysis is provided by the 
Harngan Report, assessing the relationship betvveen the General Apti- 
tude Test Battery (GATB) and job performance measures. Out of sev- 
enty-two studies that were assembled for review, the white intercept was 
higher than the black intercept in sixty of them—that is, the GATB 
overpredieted black performance in sixty out of the seventy-two stud- 
ies. 9 Of the twenty studies in which the intercepts were statistically sig- 
nificantly different (at the .01 level), the white intercept was greater 
than the black intercept in all twenty cases. 10 

These findings about overprediction apply to the ordinary outcome 
measures of academic and job performance. But it should also be noted 
that “overprediction” can be a misleading concept when it is applied to 
outcome measures for which the predictor (IQ, in our continuing ex- 
arnple) has very low validity. Inasmuch as blacks and whites differ on 
average in their scores on some outcome that is not linked to the pre- 
dictor, the more biased it will be against whites. Üonsider the next fig- 
ure, constructed on the assumption that the predictor is nearly invalid 
and that the two groups differ on average in their outcome levels. 
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A predictor with low validity may seem to be biased against whites 
if there is a substantial difference in the outcome measure 


Outcome measure 
High | 



Predictor 


This Situation is relevant to some ofthe outcome measures discussed 
in Chapter 14, such as short-term male unemployment, where the black 
and white means are quite different, but IQ has little relarionship to 
short-term unemployment for either whites or hlacks. This figure was 
constructed assuming only that there are factors influencing outcomes 
that are not captured by the predictor, hence its low validity, resulting 
in the low slope of the parallel regression lines.“ 11 The intercepts difter, 
expressing the generally higher level of performance by whites com- 
pared to blacks that is unexplained by the predictor variable. If we knew 
what the missing predictive factors are, we could include them in the 
predictor, and the intercept difference would vanish—and so would the 
implication that the newly constituted predictor is biased against 
whites. What such results seem to be telling us is, first, that IQ tests are 
not predictively biased against blacks but, second, that IQ tests alone 
do not explain the observed black-white differences in outcomes. 1t 
therefore ölten looks as if the IQ test is biased against whites. 

More on Internal Evidence of Bias: Item Analysis 

Laymen are often skeptical that IQ fest itcms could measure anything 
as deep as Intelligence. Knowing the answers seems to them to depend 
less on intelligence than on having been exposed to certain kinds of cul- 
tural or historical Information. It is usuaily a short step from here to the 
conclusion that the tests must be biased. Pundits of varying sorts rein- 
force this intuition about test item bias, claiming that the middle- and 
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upper-class white culture infuses test items even after vigorous efforts tu 
expunge it. 

The data confirming Spearman’s hypothesis, which wc discussed at 
some length in Chapter 1.3, provide the mosr convincing conceptuai 
refutation of this allegation by providing an alternative explanation that 
has been horne out by many studies: the items on which blacks and 
whites differ most widely are not those with the mosr esoteric cultural 
content, hut the ones rhat best measure the general intelligence factor, 
g. 12 Bur many other studies have directly asked whether the cultural con- 
tent of items is associated with the magnitude of the black-white dit- 
(erence, which we review here. 

One of the earliest of the studies, a 1951 doctoral thesis at Catholic. 
University, proceeded on the assumption that some test items are more 
dependent on exposure to culture rhan others. 1 ' Frank McGurk, the 
study’s author, conscquently had large numbers of independent judges 
rate many test items for their cultural loading. On exploratory tests, he 
was ahle to establish each item’s general diff iculty, which is defined sim- 
ply as the proportion of a population that gets the item wrong. He could 
therefore identify pairs of items, one highly loaded with cultural infor- 
mation and rhe orher not highly loaded hut of equal difficulty. Now, fi¬ 
nal ly, the crucial evaluation could be made with a sample of black and 
white high school students matched for schooling and socioeconomie 
hackground. The black-white gap, he discovered, was abour twice as 
large on items rated as low in cultural loading as on items rated as high 
in cultural loading. Consider, for example, a pair of equally difficult tesr 
items. The one that is culturally loaded is prohably difficult hecause it 
draws on esoteric knowledge; the other item is prohably difficult hecause 
it calls on complex cognitive processing— g. Mcüurk’s results under- 
mined the proposition that access to esoteric knowledge was to blame 
for the black-white difference. 

Another approach in the pursuit of test-item hias is based on which 
items blacks and whites find hard or easy. Conceptually, this is much 
like McGurk s approach, except that it does not require us to have items 
rated by experts, a subjective procedure that some might find suspect. 
Instead, if the cultural influence matters and if blacks and whites have 
access to different cultural backgrounds, then items that pick up these 
cultural differences should split the two groups. Items drawing on cul- 
tural knowledge more available to whites than to blacks should be, on 
average, relatively easier for whites than for hlacks. Items lacking this 
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tip for whites or items with a tip for blacks should not he differentially 
easier for whites and may be easier for blacks. 

This idea is tested by ranking the items on a test separately for whites 
and for blacks, in Order of diffieulty. That is, the easiest item for whites 
is the eine with the highest Proportion of correct answers among whites; 
the next easiest item for whites is the one with the second highest Pro¬ 
portion of correct answers for whites; and so on. Now repeat the proce- 
dure using the blacks’ proportions of correct answers. This will result in 
two sets of rank Orders for all the items. The rank-order correlation he- 
tween them is a measure of the test-item bias hypothesis: The larger the 
correlation is, the less Support the hypothesis finds. Alternarively, ihc 
proportions of correct responses within each group are transformed into 
Standard scores and then correlated by sorne other measure of correla¬ 
tion, such as the Pearson product-moment coefftcient. 

Either way, the result is clear. Relative item difficulties are essentially 
the same for hoch races (by sex). That is, blacks and whites of the samt' 
sex come close to f inding the same item the easiest, the same item next 
easiest, all the way down to the hardest item. When the rank order 
of diffieulty differs across races, the differences tend to he small and utv 
systematic. Rank order correlations above .95 are not uncommon for 
rhe items on the Wechsler and Stanford-Binet tests, which are, in fact, 
the tests that provide most of the anecdotal material for arguing that 
test items are biased. Pearson correlations are often somewhat lower hur 
typically still above .8. Moreover, when items do vary in diffieulty across 
races, most of the Variation is eliminated by taking mental age into ac- 
count. Since blacks and whites of the same chronological age differ on 
average in mental age, allowing a compensating lag in chronological age 
will neutralize the contribution of mental age. Compare, say, the item 
difficulties for 10-year-old blacks with that for 9-year-old or 8-year-oId 
whites. When this is done, the correlations in diffieulty almost all rise 
into the .9 ränge and above. IS 

Because Atem bias” ordinarily defined has failed to materialize, rhe 
concept has been extended to encompass item charactenstics that are 
intertwined with the underiying rationale for thinking that an item mea- 
suresg. For example, one researcher has found that the black-white gap 
is diminished for items that call for the subject to identify the one false 
response, compared ro items requiring the subject to identify the one cor¬ 
rect response. Ih Is this a matter of bias, or a matter of how well the two 
types of items tap the construct called intelligence? This in turn brings 


us full circle to Spearman’s hypothesis discussed in Chapter 13, which 
öfters an interpretative framework for explaining such differences. 

More on Other Potential Sowrces of Bias 

We turn now to one of the least precisely hut most commonly argued 
reasons for thinking that tests are biased: Tests are a sort of game, and, 
as in most games, it helps to have played the testing game, it helps to 
get coaching, and it helps to be playing on the Home field. Privileged 
groups get more practice and coaching than underprivileged groups. 
They have a home-court advantage; the tests are given in familiär en- 
vironments, administered by familiär kinds of people. A rnajor pari of 
the racial differences in test scores may be attributed to these differences. 
ln this discussion, we begin with coaching and practice, then turn to 
some of the other ways in which the testing Situation might influence 
scores. 

PRAtrriU: AND t'OACHINU. For IQ tests, coaching and practice are not 
a significant issue because coaching and practice effects exist only un- 
der conditions that virtually never apply. To get a sizable practice effect 
for an IQ test, it is necessary to use subjects who have never taken an 
lQ-like test, administer the identical test twice, and do so quickly (prefer- 
ably within a few weeks). 17 If the subjects fail to meet any of those con¬ 
ditions, the chances offinding a practice effect are small, and the size 
of any effect, if one is found, will be just a few points. Coaching effects 
are even harder to obtain. We are unable to identify any IQ data in any 
study, large or small, in which the results are compromised because the 
IQ scores of part of the sample have been obtained after this kind of ex- 
perience. That’s not the way that IQ tests have been administered any- 
where ro any significant sample at any time during the history of IQ 
testing—except to the samples used to assess practice and coaching ef¬ 
fects, and sometimes to the subjects of intensive remedial programs such 
as those discussed in Chapter 17. 

The stury regarding practice and coaching for such tests as the 
Scholastic. Aptitude Test (SAT), the Law School Admission« Test 
(LSAT), and the Medical College Admissions Test (MCAT) is much 
more contentious than the story about IQ. Many people do take these 
tests more than once, many people practice for them, and many people 
get extensive coaching. Moreover, these tests are supposed to he coach- 
able,” insofar as they measure the verbal, reasoning, and analytic skills 
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that a good education is supposed to enhance, and prolonged exposure 
to such coaching should produce better scores. Or to put it another way, 
two students with the same IQ should be able to get different LSAT and 
MCAT scores if one Student has taken more appropriate courses and 
studied harder than the other Student. That SAT scores declined by ah 
most half a Standard deviation front 1964 to 1980 strongly suggests that 
something coachable—or “negatively coachable” in this example—is 
heing measured. In Chapter 17, vve discuss the effects of coaching for 
the SAT V which are real but also smaller and harder to obtain than the 
widely advertised clairns of the coaching industry. 

The belief that coaching might explain part of the black-white gap 
offen rests on a notion that, on the average, hlacks receive less of the 
practice and coaching that might have elevated their scores than does 
the average white. We have already undermined this notion by show- 
ing that the tests are biased against blacks neither predictively nor in 
terms of particular item difficulties. There is, however, a literature that 
bears more directly on this idea, by looking for an interaction effect be- 
tween practice or coaching and race. 

If practice and coaching explain any portion of a group difference in 
scores in the population as a whole, then it neeessarily follows that rep- 
resentative samples of those groups who are equally well practiced and 
well coached will show a smaller difference than is observed in the pop¬ 
ulation ar large. It is not enough that practice or coaching raises the 
mean score of the lower-scoring group; it must raise its mean score more 
than it raises the score of the higher-scoring group. 

Several studies have investigated wherher this is found for blacks and 
whites. In a welbdesigned study, representative samples of hlacks and 
whires are randomly divided into two groups. The experimental black 
and white groups receive identical coaching (or practice), and the Con¬ 
trol groups receive no treatment at all. At the end of the experiment, 
the invesrigator has four different sets of results: test scores for coached 
hlacks, uncoached blacks, coached whites, and uncoached whites. 
These results may be analyzed in three basic ways: One may compare 
blacks overall with whites overall, which will reveal the ma/n effect of 
race; or the coached samples overall with the uncoached samples over¬ 
all, which will reveal the main effect of the coaching; or the way in which 
the effects of coaching vary according to the race of the persons being 
coached, known as the interaction effect. 

One study found a statistieally significant differential response to 


practice, but not to direct instruction, on a reasoning test, between 
black and white College students. 18 The differential advantage of prac¬ 
tice for hlacks compared to whites was about an eighth of the overall 
hlack-white gap on this test. Other studies have failed to find even this 
tnueh of a differential response, or they have found differential re- 
sponses in the opposite direction, tending to increase the black-white 
gap after practice. w Taking the evidence as a whole, any differential 
coaching and practice effects by race (or socioeconomic Status) is at 
most sporadic and small. If such a differential effect exists, it is too small 
to be replicated reliably. The scattered evidence of a differential effect 
is about as supportive of a whire advantage from coaching as of a black 
advantage. 

EXAMINER EFFECTS AND OTHER SITUATION AL VARIABLES. Is it possible 
that disadvantaged groups come to the test with greater anxiety than 
confident middle-class students, and this mental state depresses their 
scores? That, when a black Student takes a bus across town to an unfa- 
miliar neighborhood and goes into a test ing roorn filled with white stu¬ 
dents and overseen by a white test Supervisor, this Situation has an 
intimidating effect on performance? What about the time limits on 
tests? Might these have more pronounced effects on disadvantaged sru- 
dents than on test-wise middle-class students? All are plausible ques- 
rions, but the answer to each is the same: Investlgations to date give no 
reason to believe that such considerations explain a nontrivial portion 
of the group differenc.es in scores. 

The race of the examiner has been the subject of numerous studies. 
Of those with adequate experimental designs, most have showed non- 
significant effects; of the rest, the evidence is as streng that the pres- 
ence of a white examiner reduces overall black-white difference as that 
a white examiner exacerbates the difference. 20 Examinations of the re¬ 
sults of time pressures fail to demonstrate either that blacks do better in 
untimed than in timed tests or that the test-taking “personal tempo” of 
blacks is different from that of whites. 21 Test anxiety has heen investi¬ 
gated extensively hut, as in so many other aspects of this discussion, the 
relationship tends to be the opposite of the expected one: To the extent 
that test anxiety affects performance at al l, it seems to help slightly. Only 
a few studies have specifically addressed black-white differences in test 
anxiety; they have shown either nonsignificant results, or that the white 
subjeers were slightly more anxious than the black suhjects. 22 
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“BLACK EnGLISH.” Language looms larger. It is well established that the 
students from many different cultural backgrounds for whom English is 
a second language tend to score better on the nonverbal part of the tesr 
than a verbal component given in English. 21 Whereas this imhalance 
may be independent of language for East Asians (Japanese in Japan have 
Superior nonverbal scores even taking verbal test batteries designed in 
Japanese), it is also manifest among Latinos, who do not otherwise ex- 
hibit the characteristic East Asian verbal-nonverbal pattem. This sug- 
gests that students who are taking the test in a second language suffer 
some decrement of their scores. 

It has been a small step from this to hypothesize that, for pracrical 
purposes, many blacks are taking the test in a “second language,“ with 
their first language being the dialect known as “black English,” uhiqui- 
tous in the black inner city and used to some extent by blacks of broader 
socioeconomic backgrounds. Researchers have approached the issue in 
several ways. First, the evidence indicates that black children who use 
black English understand Standard English at least as well. 24 A more di¬ 
rect test came in the 1970s, when L. C. Quay had the Stanford-Binet 
translated into black dialect and tested several samples with both the 
original and the revised version. The studies produced no evidence that 
black students in any of the various test groups benefited (the differ- 
ences in scores from the two tests generally amounted to less than one 
IQ point). 25 But the most powerful data suggesting that language does 
not explain the black-white difference is provided by the evidence for 
Spearman’s hypothesis presenred in Chapter 13: If language were the 
problem, then. blacks would be at the greatest disadvantage on test items 
that rely on a knowledge of Standard English and be at the least disad- 
vantage on test items that use no language at all. As we discuss with re- 
gard to Spearmans hypothesis in Chapter 13, this expectation is 
contradicted by a large and consistent body of work. Black populations 
generally do relati vely better on test items that are less saturated with g 
and relatively worse on items more saturated withg, whether the items 
are verbal or nonverbal. 

The Continuing Debate 

Allegations that standardized tests are culturally biased still appear, and 
presumably this account will fuel additional ones. What about all the 


articles appearing in many quarters making these claiins? They make up 
a varied lot, but typically consist of allegations that ignore the data. A 
particularly striking example was a long article entitled “IQ and Stan¬ 
dard English,“ which appeared in a technical journal and attributed the 
hlack-white IQ test differences to language difficulties. The article was 
tollowed by tour responses, plus by a counterstatement by the author. 
Neither the original article nor any of the responses cited any of the 
data discussed above. 26 The debate was carried on enrirely on the basis 
of argumentation about the extent to which black culture is more orally 
based than white culture. This readiness to theorize ahout what might. 
be true about black-white differences in test scores while ignoring the 
pertinent data is common. 

Other articles, cited in the note, have discussed a variety of ways in 
which culture Internets with human functioning, intellectual and oth¬ 
erwise. 2 ^ The movement surrounding Howard Gardners concept of 
multiple intelligentes (see the Introduction) is only the best known of 
these new ways of talking about intclligence. But these discussions do 
not try to argue with the two core Statements that we have made: In the 
major standardized tests, test items funcrion in the same way for both 
blacks and whites, and the tests results are similarly predictive for blacks 
and whites, tending to overpredict black perforinance rather than uiv 
derpredict it. 

In the populär media, the persistence of belief in cultural bias, we 
think, is based on a misapprehension. To many people, proof that tests 
are unbiased seems tantamount to proof that the black-white gap re- 
flects genetic differences in intelligence. Since they reject the possihil- 
ity that genetic differences could be involved, the tests must be biased. 
One of the major purposes of Chapter 13 is to discredit both the no- 
tion that real differences in intelligence must be genetically founded 
and the assumption that a role for genes must have horrific con- 
sequences. 

IS THE BLACK-WHITE DIFFERENCE IN COGNITIVE ABILITY 
SHRINKING? 

The text discusses the evidence for converging black and white test 
scores on the NAEP (National Assessment of Education Progress) and 
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the SAT. Here, we summarize other sources of data about the two eth- 
nic populations. 

National High School Studies, 1972 and 1980 

In 1972 and 1980, the federal government sponsored large-sample stud- 
ies intended to provide reliable national estimates of the high school 
population. As part of both studies, tests measuring vocabulary, reading, 
and mathematics were administered to all participants. Although not 
technically IQ tests, all three had highg loadmgs. Furthermore, the tests 
were virtually identical for the two test administrations, 1 ^ 1 and the study 
procedures in 1980 were deliberately constructed to maximize the conv 
parability of the two samples. In 1982, the sophomores from the 1980 
sample were tested as seniors. The table below summarizes the results 
for the three test years hy ethnic group. The black-white difference di' 
minished on two of the three tests, but all of the shrinkage came about 
because white scores feil, not because black scores rose. Indeed, black 
scores also feil on all three tests but (except in the case of vocabulary), 
by less than the reduction in white scores. 


Black-White Difference for High School Seniors in 
1972, 1980, and 1982 

White-Black Difference, in SDs 



1972 

1980 

1982 

Vocabulary 

1.00 

.87 

1.02 

Reading 

.99 

.85 

.78 

Mach 

1.09 

.91 

.86 

Source: Rock er al, 

, 1985, Appendixe 

is R.C, E. 



College Board Achievement Tests 

Thh SAT In Chapter 1 3, we noted that the overall black-white gap in 
SAT scores had narrowed hetween 1976 and 1993, from 1.1 6 ro .88 Stan¬ 
dard deviation in the verbal portion of the test and from 1.27 to .92 
Standard deviation in the mathematics portion of the testT' More de- 
tailed breakdowns arc available for the period 1980 to 1991, as shown 
in the table below. The trend is consistently positive, wirb narrowing 


Reductions in the Black-White Difference on the 
Scholastic Aptitude and Achievement Tests, 1980-1991 

White-Black Difference, in SDs 



1980 

1991 

Change 

SAT-Verbal 

1.09 

.87 

-.22 

Reading subscore 

.93 

.85 

-.10 

Vocabulary suhscore 

1.09 

.85 

-.26 

SATMath 

1.10 

.90 

-.20 

Test of Standard writren English 
Achievement tests 

1.11 

.89 

-.22 

Overall average 

.83 

.78 

-.05 

English Composition 

.73 

.71 

-.02 

Literature 

.86 

.76 

-.10 

American History 

.69 

.69 

.00 

European History 

.81 

.56 

-.25 

Math l 

.75 

.75 

.00 

Math 11 

.98 

.85 

-.15 

Biology 

.77 

.68 

-.09 

Chemistry 

.69 

.74 

+ .05 

Physics 

.84 

.74 

-.10 

French 

.33 

.18 

-.15 

German 

.64 

.27 

-.57 

Larin 

.66 

.25 

-.41 

Spanish 

.50 

.35 

-.15 

Source The (College Board’s annual summaries u 

if test scores 1 

hy erhnieity. 


black-white differences of at least .1 Standard deviation units on the 
tests for Literature. European History, Math II, Physics, French, Oer- 
man, Latin, and Spanish. The average shrinkage of the gap is .05 Stan¬ 
dard deviation unit. From further analyses, we conclude that the 
narrowing is not entirely explained away by changes in the representa- 
tiveness of the black and white samples of test takers or hy declining 
white scores. 

To interpret the changes in scores on achievement tests, which are 
taken by small proportions of the SAT test takers, we used the mean 
that the College Board provides on the SAT Verbal and Math scores for 
each achievement test population in each year. The question we asked 
was: For a given achievement test, how did the place of the average test 
taker on his races cognitive ability distribution change from 1980 to 
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1991? For example, the average whire taking the Literature achieve- 
ment test in 1980 had an SAT Verbal score that put hi in at the 80 th 
percentile of white testees; in 1991, he was at the 85th percentile. Mean- 
while, the average black taking the Literature achievemenr test in 1980 
had an SAT Verbal score that put him at the 88th percentile of all black 
SAT testees; in 1991, he was still at the 88th percentile of the black dis- 
tribution. The difference between blacks and whites on the Literature 
achievement test narrowed during that period, but, given where the 
blacks and whites were relative to the white and black SAT distrihu- 
tions, it seems unlikely that the narrowing was caused by changes in the 
self-selection that artificially raised black scores relative to whites. Ten 
of the thirteen achievement tests fit this pattem, ln only three cases 
(European History, Physics, and German) did changes in the SAT Math 
or Verbal scores indicate that the black pool had become differential ly 
more selective. Only in the case of German was this difference large 
enough to account plausibly for much of the black improvement rela¬ 
tive to whites. 

THE ACT The College Board’s major competitor in the College en- 
trance examination business is the American College Testing program, 
which bas also shown decreasing differences between black and white 
students who take the test, as summarized in the table below. Reduc- 


Black-White Difference in the ACT, 1970-1991 
White-Black Difference, in SDs 



1970 

1991 

Change 

English 

1.14 

.83 

-dl 

Math 

.86 

.77 

-.09 

Science 

.97 

.91 

-.06 

Composite 

1.12 

.96 

-.16 


Nounv: A( H 1991, Tahles 1 4; Congrcssional Budget Office 1986, Fig. EG. 


tions in the gap occurred in all the subtests between 1970 and 1991, 
with by far the largest reduction on the English subtest. The magnitude 
of the overall change in the composite is about half the size of the re- 
duction observed in the black-white difference on the SAT Like the 
SAT population, the ACT’s population of black test takers has been in¬ 


creasing, suggesting that the increases in scores are not the result of a 
more selective test-taking population. 

The GRADUATE RECORD EXAMINATION (GRE). The GRE is the equiva- 
lent of the SAT for admission to graduate school in the arts and Sci¬ 
ences. Nor rnany people in any cohort take the GRE, so the sample is 
obviously highly self-selected and atypical of the population. In 1988, 
for example, the numher of white GRE test takers represented only 5.6 
percent of the 22-year-old white population; hlack test takers repre¬ 
sented 2.3 percent of its 22-year-old population. On the other hand, the 
proportions in 1988 were about the same as they were in 1979. The seif- 
selection process has remained fairly steady over the years, so it is worth 
at least mentioning the resuIts, as shown in the table below. The GRE 


Black-White Difference in the GRE, 1979-1988 
White-Black Difference, in SDs 



1979 

1988 

Change 

Verbal 

1.25 

1.13 

-.12 

Math 

1.28 

1.0 

-.15 

Analytical 

1.46 

1.21 

-.25 


Sourtv: Graduate Record Examination Board. 


gap narrowed only slightly less than that for the SAT. Another positive 
note is that the narrowing was achieved hecause black scores rose more 
than white scores, not hecause white scores were falling. 

These resulrs from national tests are echoed in state-level data from 
Texas and North Carolina, as reported in the Congressional Budget Of- 
fice’s survey of trends in educational achievement. H Overall, the evi¬ 
dente seems clear beyond a reasonable doubt: On College entrance tests 
and national tests of educational proficiency, the gap between whites 
and blacks remained large into the early 1990s, hut it had been nar¬ 
rowing in the preceding decade or two. The Optimist may argue that the 
trend will continue indefinitely if improvements in the environment 
and education for American blacks can be continued. The pessimist may 
note that there seems to have heen little narrowing since the mid-1980s, 
as we observed in the text for Chaprer 13, and that the black-white IQ 
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Pi ap in the NLSY seems to be widening rather narrowing in the next 
generation, as we discussed in Chapter 15. 

RUSHTON ON RACE DIFFERENCES AND REPRODUCT1VE 
STRATEGIES 

Controversy unprecedented even for the contentious subject of racial 
differences has erupted around the work of J. Philippe Rushton, a 
developmental psychologist at the University of Western Ontario. 
Rush ton argues that the differences in the average intelligence test 
scores among East Asians, blacks, and whites are not only primarily 
genetic but part of a complex of racial differences that includes such 
variables as brain size,’ 1 genital size, rate of sexual maturation, length 
of the menstrual cycle, frequency of sexual intercourse, gamete 
production, sexual hormone levels, the tendency to produce dizygotic 
twins, marital stability, infant mortality, altruism, law abidingness, and 
mental health. For each variable, Rushton has concluded, the three 
races—Mongoloids, Caucasoids, and Negroids—fall in a certain order, 
with the average Caucasoid in the middle and the other two races on 
one side or the other. The ordering of the races, he further argues, has 
an evolutionary basis; hence these ordered racial differences must in- 
volve genes. 

To reach his conclusion, Rushton Starts with the welhestablished 
observation in biology that species vary in their reproductive strategies. 
Some species produce many offspring (per parent) of which only a small 
fraction survive; others produce small numbers of offspring with re kn 
tively highsurvival rares. The involvementof parents in their offsprings’ 
health and development (which biologists call “parental investment”) 
tends to be high for species having few offspring and high survival rates 
and low for those employing the other strategy (many offspring and low 
survival rates). Many other species differences are concomitant with this 
fundamental one, according to Standard biological doctrine. 

Rushton’s thesis is that this Standard biological principle may he ap- 
plied within our own species. Rushton acknowledges that human he- 
ings are as a species far out along the continuum of low reproduction, 
high offspring survival, and high parental investment, but he argues that 
the ordering of the races on the many variables he has identified can be 
explained as the result of evolutionary differences in how far out the 
races are. According to Rushton, the average Mongoloid is toward one 


end of the continuum of reproductive strategies—the few offspring, high 
survival, and high parental investment end—the average Negroid is 
shifted toward the other end, and the average Caucasoid is in the 
middle. 

Rushton paints with a broad brush, focusing on the major racial cat¬ 
egories rather than the dozens of more fine ly dravvn reproductively 
isolated human populations that might test his theory more con- 
clusively. But heyond that, his thesis raises numerous questions—moral, 
pragmatic, and scientific. Many critics attack the theory on scientific, 
not just moral, grounds. They question whether Rushton has really 
shown that the races are consistently ordered in the way he says they 
are, or whether a biological theory that was meant to explain species 
differences can he properly applied to groups within a single species, or 
whether the evidence for genetic influences on his variables Stands up. 
Rushton has responded to his critics with increasingly detailed and con- 
vincing empirical reports of the race differences in some of the traits on 
his 1 ist, and he cites preeminent biological authority for his use of the 
concept of reproductive strategies. He has strengthened the case for 
consistently ordered race differences, at least for some of the variables 
he discusses, since his first formulation of the theory in 1985. Never- 
rheless, the theory remains a long way from confirmation. 

We cannot at present say who is more nearly right as a matter of Sci¬ 
ence, Rushton or his criticsd 2 However, Rushton’s work is not that of a 
crackpot or a bigot, as many of his critics are given to charging. Nor are 
we sympathetic with Rushton’s academic colleagues or rhe politicians 
in Ontario who have called for his peremptory dismissal from a tenured 
professorship. Setting aside whether his work is timely or worthwhile— 
a judgment we are loath to make under any circumstanc.es —it is plainly 
Science. He is not alone in seeking an evolutionary explanation of rhe 
observed differences among the races. 15,1 As Science, there is nothing 
wrong with Rushton’s work in principle; we expect that time will teil 
whether it is right or wrong in fact. 
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Regression Analyses from 
Chapter 14 


This appendix presenrs rhe regression analyses underlying rhe presen- 
tation in Chapter 14. 

The results in Chapter 14 and in this appendix are based on separate 
regressions for each of the three ethnic groups in question (black, 
Latino, and white). This procedure was chosen in preference to a sin¬ 
gle regression entering ethnicity as a nominal variable so that the rela- 
tionships would not he constrained to a single slope. The regressions 
used the entire NLSY sample, with exelusions as noted for specific 
analyses, applying 1990 sample weights. 

LOGISTIC REGRESSIONS 

All the indicators in Chapter 14 except for those involving income are 
binary variables, and the mode of analysis is logisrie regression. The In¬ 
terpretation of logistic regressions is discussed in Appendix 4. 

The data tables use short labels for the indicators. The full descrip- 
tion ot each indicator and associated characteristics of the analysis are 
shown in Tahle 1. 

Table 2 first summarizes the results, by ethnic group, for four sets of 
regressions: when age (zAge) is the only independent variable, when 
age and IQ (zAFQT) are independent variables, when age and parental 
SES (zSES) are independent, variables, and when all three are entered 
as independent variables. Three Basic questions are then examined: 

1. How much do ethnic differences change when IQ is taken into 
account? 

2. How much do ethnic differences change when parental SES is 
taken into account? 

3. What are the comparative roles of IQ and parental SES? 
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Because zAge, zAFQT, and zSES are all expressed as Standard scores 
with mean of zero and Standard deviation of 1, the intercept for the 
equation (abhreviated Jnt. in the tables) represents the expected value 
when those variables are set at their respective means. The coefficients 
for zAFQT and zSES are given so that you may examine the slopes as- 
sociated with them. 

The summary columns (Table 3) show the computed probabilities of 
the dependent variable when the independent variables are set at their 
means. 

Incorne Analyses 

Following the tables showing the logistic regressions, we present the 
detailed results of the ordinary least squares regressions used to estimate 
differences in income by ethnicity (Table 4). Because education is such an 
important causal factor in income, we show analyses in which years of ed¬ 
ucation (as of the 1990 interview) replaces IQ as an independent variable. 

The first set of models shows the parameters for wages of fulTtime, 
year-round workers by ethnic group. The sample for this analysis con- 
sisted of all persons in the NLSY who reported working for fifty-two 
weeks in 1989, had a reported wage greater than 0 (a handful of appar- 
ently self-employed persons who reported working fifty-two weeks re- 
ported no income), had an identified occupation, and had valid scores 
for IQ, parental SES, and educational level as of 1990. The second set 
of models shows the parameters for total family income from all sourc.es . 
The sample for this analysis includes all persons with valid scores on the 
independent variables, excluding only those who reported being out of 
the labor force in 1989 or 1990 because of enrollmenr in schon!. 

Table 5 shows the results when IQ, parental SES, and education are 
all entered as independent variables. Education is expressed as the high- 
est degree attained as of 1990 (no high school diploma, high school 
diploma, associate degree, bachelor’s degree, Professional degree). 

Table 6 shows the analysis of wages by ethnicity and occupational 
grouping based on the subject’s occupation in the 1990 interview (the 
variable labeled “00090”), using the 1970 U.S. Census Occupational 
Classification System. The Software used for these analyses, JMP 3.0, 
treats nominal variables differently from the convention in rnany other 
regression packages. See the introduction to Appendix 4 for details and 
an example. 



Table 2 Coefficients for Logistic Regression Analysis in Chapter 14 


Independent Controlling for Age Controlling for Age and IQ Controlling for Age and Parental SES 

Variables (zAge) (zAge, zAFQT) (zAge, zSES) 

White Black Latino White Black Larino White Black Latino 

Indicator Int. Int. Int. Int. IQ Int. IQ Int. IQ Int. SES Int. SES Int. SES 

Sample: NL5Y subjects 


High school dropout 

-2.271 

-1.598 

-1.080 

-2.943 

-1.995 

-3.676 

Bachelors degree 

-1.018 

-2.089 

-2.223 

-2.004 

2.127 

-1.078 

High'IQ occuparion 

-2.871 

-3.550 

-3.335 

-3.909 

1.532 

-2.997 

ln poverry 

-2.560 

-1.066 

-1.512 

-2.671 

-.957 

-2.128 

Unemployed 1 mo, (men) 

-2.714 

-1.352 

-1.819 

-2.127 

-.318 

-1.706 

Married by 30 

— 

— 

— 

1.336 

-193 

.311 

Ever on welfare (all women) 

-1.911 

-.053 

-.856 

-1.994 

-1.191 

-.853 

Bver on welfare (poor mothers) 

.487 

1.287 

.592 

.250 

-.387 

1.066 

Ever in jail (men) 

-3.697 

-1.895 

-2.800 

-3.917 

-1.067 

-3.015 

“Yes” on MCV index 

.026 

-1.362 

-.824 

-.061 

.727 

-.744 

Sample: Children n/ jVLSY mothers 

Born out of wedlock 

-2.008 

.504 

-1.190 

-2.153 

-.774 

.030 

Low birth weight 

-3.326 

-2.191 

-2.888 

-3.378 

-.387 

-2.668 

ln poverry Ist 3 yrs. 

-2.287 

.151 

-.856 

-2.693 

-1.608 

-1.799 

Ever in nonparental care 

-5.286 

-3.405 

—4-369 

-5.321 

-.454 

-3.548 

Wurst decile: HOME index 

-2.578 

-.960 

-1.317 

-2.816 

-.768 

-1.653 

Friendliness index 

-2.824 

-1.032 

-1.872 

-2.911 

-.707 

-1.745 

Difficuky index 

-2.508 

-1.272 

-2.115 

-2.484 

-.344 
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Motor & Social Dev. index 

-2.626 

-2.300 

-2.054 

-2.619 

-.426 

-3.100 

Behavioral Problems index 

-1.958 

-2.052 

-2.066 

-2.067 

-. 368 

-2.894 

Any developniental index 

-2.193 

-1.888 

-1.935 

-2.239 

-.401 

-2.620 

PPVT(IQ) 

-2.575 

.202 

.139 

-2.224 

-.968 

-.693 


-1.722 

-3.046 

-2.031 

-2.503 

-1.266 

-2.245 

-.734 

-1.949 

-.654 

1.943 

-1.927 

1.987 

-1.644 

1.524 

-1.665 

1.058 

-1.634 

.908 

1.705 

-3.206 

1.379 

-3.410 

1.036 

-3.373 

1.194 

-2.892 

.736 

-1.046 

-2.274 

-.898 

-2.607 

-.661 

-1.571 

-.623 

-2.201 

-.528 

-.315 

-2.079 

-.409 

-2.139 

-.189 

-1.648 

-.350 

-2.026 

-.213 

.122 

1.070 

-.116 

1.329 

-.238 

1.88 

t 

b 

er 

1.132 

.001 

-.902 

-1.737 

-1.060 

-1.922 

-.907 

-.494 

-.605 

-1.493 

-.528 

-.186 

.169 

-.314 

.442 

-.021 

1.363 

.083 

.223 

-.203 

-.954 

-3.421 

-.657 

-3.758 

-.727 

-2.154 

-314 

-3.131 

-.229 

.720 

-.220 

.962 

-.045 

.555 

-1.040 

.467 

-.442 

.355 


-.479 

-1.594 

-.359 

-2.103 

-.622 

.262 

-.278 

-1.506 

-.191 

-.428 

-3.004 

-.128 

-3.335 

-.168 

-2.214 

-.013 

-2.877 

.002 

-2.036 

-2.239 

-1.554 

-2.860 

-1.608 

-.862 

-1.189 

-1.901 

-.752 

-.165 

-4.257 

.048 

-5.258 

-.207 

-3.314 

.074 

—4-503 

-.143 

-.647 

-2.118 

-.703 

-2.747 

-.588 

-1.451 

-.523 

-1.867 

-.347 

-.713 

-3.062 

-1.168 

-2.882 

-.557 

-1.117 

-.118 

-2.418 

-.414 

-.664 

-3.249 

-1.110 

.000 

-.341 

-1.520 

-.336 

-2.748 

-.458 

-.756 

-2.540 

-.492 

-2.614 

-.366 

-2.597 

-.344 

-2.543 

-.348 

-.700 

-2.560 

-.412 

-2.059 

-.269 

-2.234 

-.191 

-2.360 

-.173 

-.651 

-2.471 

-.494 

-2.226 

-.326 

-2.055 

-.187 

-2.298 

-.244 

-.942 

-.851 

-1.260 

-2.603 

-.546 

-.148 

-.547 

-.187 

-.691 


Table 2 (Cont’d) Coefficients for Logistic Regression Analysis in Chapter 14 


Controlling for Age, IQ, and Parental SES 

Independent Variables (Independent variables: zAge, zAFQT, zSES) 


Indicator 

Int. 

White 

IQ 

SES 

Sample: NLSY subjects 

High school dropout 

-2.961 

-1.705 

-.626 

Bachelors degree 

-2.360 

1.854 

1.047 

HighdQ occuparion 

-4.065 

1.356 

.487 

In poverry 

-2.658 

-.818 

-.274 

Unemployed 1 mo. (men) 

-2.124 

-.305 

-.027 

Married by 30 

1.354 

-.102 

-.187 

Ever on welfare (all women) 

-1.983 

-.932 

-.516 

Ever on welfare (poor mothers) 

.258 

-.445 

.116 

Ever in jail (men) 

-3.933 

-.949 

-.234 

“Yes” on MCV index 

r / ~'. 

o 

r 

.597 

.274 

Sample: Children of NLSY mothers 

Born out of wedtock 

-2.178 

-.614 

-.344 

Low birth weighr 

-3.378 

-.406 

.039 

In poverry Ist 3 yrs. 

-3.009 

-1.141 

-1.198 

Ever in nonparental care 

-5.321 

-.471 

.034 

Worst Jecile: HOME index 

-2.657 

-.626 

-312 

Friendliness index 

-2.926 

-.552 

-.301 

Difficulry index 

-2.493 

-.223 

-.237 

Motor ist Social Dev, index 

-2.620 

-.323 

-.220 

Behavioral Problems index 

-2.101 

-.300 

-.159 

Anv developmental index 

-2.256 

-.310 

-.191 

PPVTUQ) 

-2.258 

-.875 

-.220 


Int. 

Black 

IQ 

SES 

Lnt, 

Latino 

IQ 

SES 

-3.930 

-1.612 

-.424 

-3.170 

-1.944 

-.160 

-.971 

1.718 

.611 

-1.797 

1.827 

.242 

-2.987 

1.382 

6.90 

-3.136 

1.287 

.133 

-2.279 

-.904 

-.368 

-2.489 

-.752 

-.273 

-1.809 

-.214 

-.281 

-2.119 

-.373 

-.060 

.282 

.167 

-.099 

1.126 

-.161 

.073 

-.967 

-.725 

-.377 

-1.930 

-.910 

-.266 

1.175 

-.235 

.146 

.000 

-.235 

-.169 

-3.034 

-.939 

-.048 

-3.388 

-.683 

.049 

-.672 

.614 

.248 

-.221 

.963 

-.002 


-.034 

-.4M 

-.149 

-1.661 

-.307 

-.086 

-2.608 

-.487 

.142 

-2.957 

-.168 

.064 

-2.342 

-1.725 

-.737 

-2.618 

-1.316 

-.420 

-3.489 

-.225 

.146 

-4.432 

.171 

-.203 

-1.870 

-.501 

-.392 

-2.238 

-.616 

-.149 

-1.717 

-.745 

.078 

-3.075 

-1.157 

-.017 

-1.998 

-.590 

-.188 

-3.326 

-1.046 

-.097 

-3.169 

-.700 

-.145 

-2.721 

-.352 

-.227 

-2.893 

-.700 

.000 

-2.597 

-.387 

-.044 

-2.618 

-.652 

.002 

-2.547 

-.436 

-.097 

-.845 

-.837 

-.374 

-.949 

-1.062 

-.344 




























Table 3 Expected Probabilities for Logistic Regression Analyses in Chapter 14 


When Age and IQ When Age and Parental When Age, IQ, and 
When Age Is Average Are Average SES Are Average Parental SES are Average 

(zAge = 0) (zAge = 0, zAFQT = 0)(zAge = o, zSES = 0) (zAge = 0, zAFQT = 0, zSES = 0) 


Indicator 

White 

Black 

Latino 

White 

Black 

Latino White 

Black Latino 

White 

Black 

Latino 

Sample: NLSY subjects 

High school dropout 

9.4% 

16.8% 

25.4% 

5.0% 

2.5% 

4.5% 

7.6% 

9.6% 

12.5% 

4.9% 

1.9% 

4.0% 

Bachelors degree 

26.5 

11.0 

9.8 

11.9 

25.4 

12.7 

16.2 

15.9 

16.3 

8.6 

27-5 

14.2 

HighdQ occupation 

5.4 

2.8 

3.4 

2.0 

4.8 

3.9 

3.2 

3.3 

5.3 

1.7 

4-8 

4-2 

ln poverty 

7.2 

25.6 

18.1 

6.5 

10.6 

9.3 

6.9 

17.2 

10.0 

6.5 

9.3 

7.7 

Unemploycd 1 mo. (men) 

10.2 

20.6 

14.0 

10.7 

15.4 

11.1 

10.5 

16.1 

11.7 

10.7 

14.1 

10.7 

Married by 30 

77.7 

54.2 

76.1 

79.2 

57.7 

74.5 

79.1 

54.7 

75.6 

79.5 

57.0 

75.5 

Ever on welfare (all women) 

12.9 

48.7 

29.8 

12.0 

29.9 

15.0 

12.8 

37.9 

18.3 

12.1 

27-5 

12.7 

Ever on welfare (poor mothers) 

61.9 

78.4 

64-4 

56.2 

74.4 

54.2 

60.9 

79.6 

55.6 

56.4 

76.4 

50.0 

Ever in jail (men) 

2.4 

13.1 

5.7 

2.0 

4.7 

3.2 

2.3 

10.4 

4.2 

1.9 

4.6 

3.3 

“Yes” on MCV index 

50.6 

20.4 

30.5 

48.5 

32.2 

44-5 

48.9 

26.1 

39.1 

47.9 

33.8 

44.5 

Sample: Children ofNLSY mothers 

Bom out of wedlock 

11.8 

62.3 

23.3 

10.4 

50.7 

16.9 

10.9 

56.5 

18.2 

10.2 

49.2 

16.0 

Low birth weight 

3.5 

10.1 

5.3 

3.3 

6.5 

4.7 

3.4 

9.9 

5.3 

3.3 

6.9 

4.9 

In poverty Ist 3 yrs. 

9.2 

53.8 

29.8 

6.3 

14-2 

9.6 

5.4 

29.7 

13.0 

4-7 

10.5 

6.8 

Ever in nonparental care 

0.5 

3.2 

1.3 

0.5 

2.8 

1.4 

0.5 

3.5 

1.1 

0.5 

3.0 

1.2 

Worst decile: HOME index 

7.1 

27.7 

21.1 

5.6 

16.1 

10.7 

6.0 

19.0 

13.4 

5.4 

13.4 

9.6 

Friendliness index 

5.6 

26.3 

13.3 

5.2 

14-9 

4-5 

5.3 

24.7 

8.2 

5.1 

15.2 

4.4 

DifFiculry index 

7.5 

21.9 

10.8 

7.7 

12.8 

3.7 

50.0 

17.9 

6.0 

7.6 

11.9 

3.5 

Motor (St Social Dev. index 

6.7 

9.1 

11.4 

6.8 

4.3 

7-3 

6.8 

6.9 

7.3 

6.8 

4-0 

6.2 

Behavioral Problems index 

12.4 

11.4 

11.2 

11.2 

5.2 

7.2 

11.5 

9.7 

8.6 

10.9 

5.3 

6.9 

Any Jevelopmenral index 

10.0 

13.1 

12.6 

9.6 

6.8 

7.8 

9.7 

11.4 

9.1 

9.5 

6.8 

7.3 

PPVT (IQ) 

7.1 

55.0 

53.5 

9.8 

33.3 

30.3 

6.9 

46.3 

45.3 

9.5 

30.0 

27.9 


Table 4 Income Analyses in Chapter 14 (in 1990 dollars) 


Model I 
(zAge) 


White 

Black 

Latino 

White 

Black 

Latino 

White 

: Annual wages for full-time year-round workers , 

1989 


27,372 

20,994 

23,409 

25,546 

25,001 

25,159 

25,329 

2,814 

1,354 

2,163 

1,968 

861 

1,896 

2,057 




5,660 

5,454 

4,018 

4,850 







1,753 


Independent 
Variables 

Dependent van 
Jntercept 
Age 

IQ 

Parental SES 
Education as of 1990 


Dependent variable: Total family income, 1989" 
Intercept 41,558 29,880 35,514 

Age 3,326 1,576 3,628 

IQ 

Parental SES 
Education as of 1990 

Minoriry income as a percentage of white income 

Annual wages 76-7% 85.5% 

Family income 71.9% 85.5% 


Model II 
(zAge, zAFQT) 


Model 111 

(zAge, zAFQT, zSES) 
Black Latino 


25,339 

949 

4,875 

1,420 


26,002 

2,144 

3,162 

1,352 


39,225 

2,049 

8,332 


36,432 

39,689 

38,623 

37,723 

41,051 

805 

2,709 

2,354 

888 

3,010 

8,590 

8,870 

5,936 

5,804 

7,488 



5,097 

5,817 

2,223 

97-9% 

98.5% 


100.0% 

102.7' 

92.9% 

101.2% 


97.7% 

106.3' 


Model IV 
(zAge, zEduc90) 
White Black Latino 


26,292 

2,636 


40,194 

2,921 


* For persons not out oflabor force because of school in 1989 or 1990. 


20,962 

1,333 


24,092 

2,188 


5,395 4,794 2,755 


30,511 

1,415 


37,565 

3,553 


8,313 7,933 7,764 


79.7% 

75.9% 


Model V 


91.6% 

93.5% 


(zAge, 

zEduc90, zSES) 

White 

Black 

Latino 

26,048 

21,841 

25,687 

2,632 

1,377 

2,463 

1,341 

1,626 

2,000 

4,790 

4,140 

1,934 

39,453 

33,841 

40,009 

2,963 

1,294 

3,788 

4,217 

6,022 

3,109 

6,394 

5,346 

6,360 


83.8% 

98.6% 


85.8% 

101.4% 
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Table 5 Income Analyses in Chapter 14 (in 1990 dollars), by De- 
gree Attained 


Dependent Variable: 

Armani Wägers for FulFTime, 




Vear-Round Workers, 1 989 

Total Fatnily Income 11 

Independent Variables 

White 

Black Latino 

White 

Black Latino 

[ntcrcept 

26,994 

27,048 26,474 

40,813 

38,050 41,271 

Age 

2,338 

787 2,207 

2,583 

946 3,091 

IQ 

3,082 

3,802 2,507 

3,025 

4,247 4,1 36 

Parental SES 

914 

840 1,248 

3,648 

5,191 2,042 

Highest degrea attained 





Less than high schon] 

-4,992 

-3,688 -1,588 

-9,743 

-4,181 -9,461 

UED 

-2,622 

-3,950 -3,039 

-5,202 

-4,159 -7,68 3 

High school diploma 

-2,602 

-3,944 -1,151 

-2,789 

-2,817 -1,269 

Bachelors degree 

3,329 

734 2,938 

4,286 

4,362 10,506 

Graduate degree 

6,887 

10,848 2,840 

13,448 

6,795 7,907 

Minoritv income os u 





percentagc of white income 

100.2% 9 8.1% 


93.2% 10 L 1 % 

1 For persons not out ot lahor force bec 

hu sc of school in 1989 

or 1990. 


Sample sizes for the 

different occupations analyzed in Table 6 below 

are as follow: 







White 

Black 

Latino 

Professional/technk 

:al 

605 

143 

129 

Managers/ad min istrators 

462 

110 

103 

Clerical workers 


473 

260 

172 

Sales workers 


163 

34 

30 

Craft and kindred u 

/orkers 

370 

113 

106 

Transport operatives 

95 

55 

40 

Other operatives 


231 

143 

67 

Service workers 


289 

218 

95 

Unskilled laborers 


98 

78 

40 

Farmworkers 


22 

4 

12 


Because of rhe small numbers of farmworkers, thar category is omitted 
front the table. Note, however, that farmworkers were included in the 
aetual regression equation; hence the eoefficients for the nominal oc- 
cupation categories will not sum to zero. 


Table 6 Income Analyses in Chapter 14 (in 1990 dollars) 



-killeJ hhnrer- 

















Appendix 7 


The Evolution of Affirmative 
Action in the Workplace 


Much of the current debate about affirmative action in employment 
takes place in ignorance of the original objectives of affirmative action 
and the vvays in which antidiscrimination law has evolved. Because we 
believe that retuming to the original intention of affirmative action is 
a key to progress in social policy on many fronts and because our rec- 
ommendation seems so radical in the prevailing eontext, this appendix 
presents a full discussion of the nature of the original objectives and the 
evolution as it pertains specifically to employment tests. 

Affirmative action in the workplace, as distinguished from the 
broader and older civil rights movement, Starts with Title Vll of the 
Civil Rights Act of 1964. Title VII laid down principles of fair employ¬ 
ment practice as regards race, religion, national origin, and sex, and ir 
created the Equal Employment Opportunities Commission (EEOC) to 
administer and promote them. Besides Title VII (as amended over the 
years), affirmative action in the workplace comprises subsequent acts of 
Congress (and state legislatures), presidential executive Orders, rulings 
by the EEOC and other branches of govemment, and landmark court 
cases. The basic intent of all of this energetic policymaking has been to 
make workplaces fairer to people from oppressed or mistreated groups. 

As desirable as that goal may seem to just about anyone, a clear no- 
tion either of what it means or how to accomplish it does not emerge 
from the documents of this enormous (and spreading) battleground of 
law, regulation, litigation, and commentary. The good news is that many 
issues of fair employment practice need not concern us hered But we 
cannot avoid looking at how Title VII (and its elaborations) dealt with 
the use of ability tests in the selection or promotion of employees. Al- 
though the tests are given to individuals, the groups of which they are 
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a parr may average high or low compared to the popularion as a whole; 
as it happens, some of the groups who average low are protected trom 
“unfair employment practices” by Title VII. Hence the government gets 
into the business of regulating employment testing. 

The ramifications of even the narrow issue of employment test reg- 
ulation havc ranged so far and wide that employment testing has be- 
come a new specialty in the creation and practice of law and in 
government regulation undreamed of by the Founders. Thousands, per- 
haps millions, of legislative and bureaucratic man hours have heen lav- 
ished on it. Thousands of cases have heen argued in courtT 1 Douhtless 
many more cases have not been argued, as the specter of legal action 
has shaped innumerable decisions in corporate offices and boardrooms. 
The stance of the government and the courts has increasingly been to 
distrust tests that produce group differences, as if they presume that, in 
the absence of illegal discrimination, the groups should be equal. 

THE EVOLUTION OF TITLE VII 

Title VII of the 1964 Act specifically did not prohibit the use of em¬ 
ployment tests, provided that the tests were not “designed, intended or 
used” to discriminate against people because of their race, color, reli- 
gton, sex, or national origin. It said nothing about group differences, ab 
rhough it was clear in 1964 that ability tests would result in 
disproportionately fewer high scores for at least some of the groups of 
people protected from discrimination by the act. Some of the act s pro- 
ponents believed that some of the group differences in test scores were 
heuig used as a pretext for unfair discrimination; for that reason the act 
included a proviso regarding the tests. The hope was that Title VII would 
promptly eradicate this unfair use of tests. It was left to the EEOC to 
come up with the means of doing so. 

ln 1966, the EEOC formulated the first of a series of guidelines. An 
employment test, it ruled, had to have a proven power to measure a per- 
son’s “ability to perform a particular job or dass of jobs.’^ It was not 
enough, said the guideline, that the test be drawn up by Professional 
testers; ir also had to have some practical import—some “joh related- 
nessd in the evolving jargon of the field. Why this particular guideline? 
The answer is rhat staff for the newly launched EEOC had quickly be- 
come convinced that some employers were, as anticipated, hiding be- 


hind the credentials of Professional testers to use ability tests that had 
little bearing on job performance, and that they were doing so to dis- 
criminate against blacks. 4 The guideline was an attempt to pierce the 
veneer of Professional respectability and thereby correct this Violation 
of law and principle, as the EEOC saw it. 

The criterion of job relatedness did not resolve the uneasiness about 
testing for the EEOC. Ability testing for employment had, after all, be- 
come an issue under Title Vll because various groups of people get dif¬ 
ferent average scores. This was the heart of the matter, and new 
guidelines laid down in 1970 addressed it frontally. For the first time, 
EEOC guidelines mentioned the issue of disproportionate success ofdif 
ferent groups on any given test. 1 When a test “adversely affects’ (more 
jargon, along with “disparate impact” or “adverse impact ) members of 
a protected group, said the new guidelines, it had to be shown not only 
that the test really did predict joh performance but that the prediction 
was strong enough to make a significant economic difference and that 
no nondiscriminatory alternative was available. An employer, the rea- 
soning vvent, may have abandoned older and cruder forms ofdeliberately 
discriminatory treatment of workers or job applicants (often called dis- 
parate treatment”) but still he violating the intent of the law by using a 
needlessly discriminatory test. Disparate impact, in other words, was to 
be the red flag that set the EEOC in motion. 

GRIGGS AND AFTERWARD 

Soon after, the U.S. Supreme Court entered the fray. Applicants for cer- 
tain desirahle johs at the Duke Power Company had been required to 
have a high school diploma or to earn ability test scores ahove a cutoff. 
Fewer blacks were getting over these hurdles than whites; a suit found 
its way to the Supreme Court. The Court’s decision in Griggs v. Duke 
Power Co. , 6 was instantly recognized as a tuming point in the march of 
affirmative action in the workplace. 7 The Supreme Court struck down 
the use of either the tests or the educational requirement, because the 
Company was unable to satisfy the Court that either a diploma or a high 
score on a test had any bearing on the johs the applicants were being 
hired for. 8 

Duke Power Co.’s defense was, among other things, that it was try- 
ing to raise the general intellectual level of its work force by imposing 
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educational or ability testscore requirements. In the Courts unanimous 
decision (which reversed contrary opinions in both the federal district 
and Circuit courts), Chief Justice Warren Burger approved unstintingly 
of the EEOC’s guidelines: Adverse impact placed a bürden of proofon 
employers to show not just that they were not intentionally discrimi- 
nattng against the protected groups but that their testing procedures 
could be justified economically, and that no other available hiring pro- 
cedure is equally useful but less discriminatory. Said the Court, good 
(i.e., nondiscriminatory) intentions do not excuse tests “that operate as 
‘built-in headwinds’ for minority groups and are unrelated to measuring 
job capability.“ g There must be both “business necessity“ and a “mani¬ 
fest relationship” between the test and the job, as the EEOC had ruled. 
Employers were being told to be wary of off-the-shelf tests of general 
ability; if they wanted to use a test at all, they would be well advised ro 
write them for the specific job at hand and to do their own Validation 
studies. 

Ordinarily there is some presumption that people will obey guide- 
lines proposed by a federal agency like EEOC, but not doing so does not 
violate the law. lndeed, in the legislative record, Congress was assured 
that the EEOC had no enforcement powers. However, the Court in 
Criggs said that the EEOC guidelines deserve “great deference,“ 10 which 
endowed them with authority verging on the power of law itself. This 
laying on of the hands of legality is one reason that Griggs has become 
the landmark case it has tumed out to be, for only a defiant or reck less 
employer would disregard guidelines that the Court embraced so en- 
thusiastically. Beyond that, however, Griggs transformed the very con- 
ception of affirmative action in the workplace. 

The Court grounded its decision in the 1964 Civil Rights Act itself, 
although the act said nothing about job relatedness, adverse impact, or 
the lack of alternative hiring criteria. The act did, however, say that a 
test must not be “designed, intended or used” to discriminate. against 
people in the protected minority groups. Like the EEOC, the Court con- 
sidered job relatedness and adverse impact to be reasonable translations 
of Title VII’s principles into practice. But it can be argued that job re¬ 
latedness and disparate impact per se go well beyond Title Vll, because 
a test may have disparate impact and not be specifically related to the 
particular job being filled without the employers having designed, in¬ 
tended, or used it for discriminatory purposes. fll! 


The issue hinges on whether each of the three terms—“designed, in¬ 
tended or used“—must signify discriminatory intent (i.e., the guilty 
mind usually required in cases of liability) or only the first two. The first 
two terms—“designed, intended“—clearly imply discriminatory intent. 
Must the third? No, said the Supreme Court, “used“ need not. And if it 
need not, then an employer is violating Title VII even if he is not guilty 
of discriminatory intent, so long as the test has disparate impact and has 
not been proved, to the Courts satisfaction, to be job related. 1121 

After two decades in force, the Court s Interpretation may seem cor- 
rect to many readers, but both the legislative record and the wording of 
Title VII belie it. 1151 Proponents of Title VII, on the floor of Congress 
and elsewhere, repeatedly assured the Opposition that tests administered 
without discriminatory intent, however adverse their effects, were not 
being challenged, let alone banned. 1141 For example, in a memoranclum 
submil red by Senator Clifford Case, one of Title VI Fs leading advocates 
during the legislative debates, we find the following assurance: “No 
court could read Title VII as requiring an employer to lower or change 
the occupational qualifications he sets for his employees simply because 
fewer Negroes than whites are able to meet them.” h Senator Hubert 
Humphrey, as we noted in Chapter 20, also assured fellow legislators 
that Title VII would never be used to impose percentage hiring re¬ 
quirements (disparate impact criteria) on employers. 

A year later, in the Equal Employment Opportunity Act of 1972, 
Congress spoke for the third brauch of govemmcnt, allying itself with 
the Court and the EEOC. It disapproved of mere “‘papeP credentials” 
(such as cognitive ability test scores) that are of “questionable value.“ 
It warned that such credentials burdened people who were “socioeco- 
nomically or educationally disadvantaged“ with “artificial qualifica- 
tions.” 10 When it first enacted Title VII in 1964, Congress on the whole 
trusted general ability tests to serve the purpose of predicting worker 
quality; by 1972, Congress, echoing Griggs, had become far more skep- 
tical of the predictive power of those tests and suspicious that they were 
a pretext for illegal discrimination. 17 In the words of one legal scholar, 
“The central rationale of the Court’s decision in Griggs . . . was based 
«on an assumption that those of different races are inherently equal in 
ability and intelligence, and on a deep skepticism about the utility of 
devices traditionally used to select among applicants for employment.“ 18 

With all three branches of government pushing in the same general 
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direction, affirmative action policies evolved toward grcater relianee on 
disparate impact as the touchstone of illegality rather than on discrim- 
inatory intent or disparate treatment. As in Griggs, the Supreme Court 
in 1975, in Albemarie Paper Co. v. Moody, 19 considered a case in which 
an employer used intelligence tests (among other criteria) to sclect 
workers for well-paying jobs. Once again, black applicants, who earned 
lower scores than white applicants, brought suit. 20 The Court reaft'irmed 
the general outlines of Griggs, but in fillingout details, it provided three 
Steps to follow in proving that an employment test was in violation of 
Title VII (as amended). First, the Court said, a complaining parry must 
show disparate impact. This involved a Statistical proof that those who 
were hired or promoted on the basis of the test included significantly 
fewer members of a protected group than random selection from the ap- 
plicant pool would have produced. Given this proof of disparate impact, 
the bürden of proof shifts to the employer, who must now prove that 
scores on the test have a proven and vital relationship to the specific job 
they were hired for. The criterion expressed in Griggs, “business neces¬ 
sity,” was carriedforward into Albemarie. If the employer passes this hur- 
die, the complaining party can offer evidence that the employer coukl 
have used a different hiring procedure, one that was as effective in se- 
lecting workers but without the disparate impact. If this can be shown, 
then, the Court ruled, the employer has been shown to have discritni- 
nated itlegally by failing to have used the alternative procedure U!| 

Other federal authorities besides the EEOC were monitoring and pro- 
moting affirmative action in the workplace. In the mid-1970s, as in- 
consistencies began to crop up, pressure built up for coordinating as 
broad a slice of the federal involvement in affirmative action as possi- 
ble. After some false Starts, the Uniform Guidelines on Employee Se- 
lection Procedures were adopted in 1978 by EEOC, the Civil Service 
Commission (later called the Office of Personnel Management), the 
Department of Justice, the Department of the Treasury, and the De¬ 
partment of Labor.“ At this writing, they are still in force. The Court’s 
decisions in Griggs and Albemark set the broader framework for the Uni¬ 
form Guidelines, but further details were elaborated, in some respeets 
increasing the pressure on employers using tests. For example, the Uni¬ 
form Guidelines held—in contrast to the Court in Albemarie —that the 
employer has a responsibility for seeking less discriminatory selection 
procedures, a rather different matter from giving a complaining parry 
the opportunity to do so, as the Court had decreed. 


VALIDATING EMPLOYMENT TESTS 

The Uniform Guidelines attempt to define a unified approach to affir¬ 
mative action in the workplace, but practices still vary, and there con- 
tinue to he new laws and new interpretations by courts. But they come 
as close to a policy Consensus as anything does. They also reveal the un- 
derlying assumptions ahout the facts. On the matter of test validation, 
the Guidelines espouse the stringent “business necessity” requirement 
held in Griggs and Albemarie. They provide detailed requirements for 
validating tests. Without submerging our readers more deeply in tech- 
nical minuriae than seems appropriate here, let us say that the Uniform 
Guidelines lean sharply toward criteria that would he hard and expen- 
sive for employers to meet, even when cheaper or easier methods almost 
certainly would have been more effective. 25 General ahility tests, read- 
ily available and widely standardized, are rarely acceptable to the EEOC 
or rhe courts, unless the employer goes through the difticult, if not 
impossible, and, psychometrically speaking, needless, process of re- 
Standardization of an established test. To validate a test, an employer 
needs a measure of performance. The government typically rejects mea- 
sures of training performance and Supervisor ra rings. As Chapter 3 
detailed, both training scores and Supervisor ratings may he suitable 
measures of performance, and they are relatively easy to obtain. The 
measures usually required by the government are all but impossible to 
obtain, especially for job candidares who are not hired. 

Despite an air of rigor and precision in discussing validation, neither 
the EEOC nor any other branch of government involved in administer- 
ing affirmative action policies has shown any interest in evaluating just 
how predictive of worker performance the stringent and costly valida¬ 
tion procedures it demands are, or whether there is any gain in predic- 
tive power when they are used. The thrust continues to he, as it has been 
from the heginning, to increase the numbers hired or promoted from the 
protected groups, based on the underlying assumpr.ion that, except for 
discrimination or the legacy of past discrimination, the protected groups 
should be equally represented across the occupational spectrum. 

DISPARATE IMPACT 

According to the Guidelines, an employer that eomes under their ju- 
risdiction can expect to he required to validate a test—that is, to prove 
its business necessity—if there is disparate impact. And, the Guidelines 
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further say, disparate impact is assumed ifselectingemployees by the test 
violates the 80 percent rule, explained in Chapter 20. As helpful as it 
may be to employers and regulators to have a fixed Standard for disparate 
impact, the 80 percent rule is psychometrically unsound because it sets 
a fixed Standard. Given two groups with differing average scores and a 
cutoff for hiringor promotion, the ratio of those selected from the lower 
group to those selected from the higher group, given a fair hiring process , 
shrinks as the cutoff rises. 

Suppose that you are an employer faced with two groups that are of 
equal size in the applicant pool. The higher group averages one Stan¬ 
dard deviation above the lower on an IQ test, but the distribution of 
scores for each group is normal and has the same variability. The eighty 
percent rule fixes the ratio at eighty hired from the lower group (if it is 
protected by affirmative action) per hundred hired from the higher 
group. But if you want to establish a minimum IQ of 100 as the cutoff 
point for hiring workers, only slightly more than thirty applicants from 
the lower group would be selected for every hundred from the higher. 
Suppose that you need a work force with above-average IQs, so you raise 
the cutoff to an IQ score of 110. In that case, a fair hiring process could 
be expected to select only twenty of the lower group for each hundred 
selected from the upper group. If you need a work force with a minimum 
IQ of 120, the ratio drops to about ten from the lower per hundred from 
the higher. The ratio will continue to shrink indefinitely as the cutoff 
moves upward. In other words, applying the 80 percent rule has drasti- 
cally different effects for an employer hiring people for janitorial jobs 
compared to an employer hiring lawyers or accountants. Even if one is 
in favor of the concept of avoiding “disparate impact,” the 80 percent 
rule is an extreme ly unrealistic way of doing so. 


to show that the test serves legitimate Business goals. It looked as if the 
Duke Power Co. s defense in Griggs—to improve the general intellec- 
tual quality of its employees—would have inet this new Standard. Soon 
thereafter, however, Congress retaliated. The Civil Rights Act of 1991 
repudiated Words Cawc and returned to the Standards of Griggs and Albe- 
rrnrle —to business necessity, job relatedness, and disparate impact as 
those earlier dccisions had defined it. Once again, employers evidently 
must satisfy a criterion for employment testing that the Court, two years 
hefore, judged to be impossibly demanding. The new law is fraught with 
ambiguity and will doubtless send lawyers, their clients, and courts back 
to work to figure out what it requiresC But the best guess is that the 
trendline had blipped, not reversed. 


A REVERSAL IN THE AFFIRMATIVE ACTION TREND LINE, OR 
A BLIP? 

The Supreme Court in 1989 backed off from its most demanding re- 
quirements for employment testing. ln Wards C ave Pöcking Cu., Inc. v. 
Atonio / 4 it softened the Obligation on the employer in justifying dis¬ 
parate impact of a test. “Business necessity,” the Court said, is an un- 
reasonably stringent criterion, virtually impossible for most ordinary 
businesses to meet. The result of so extreme a requirement, warned the 
Court, would be “a host of evils.” 2 ’ It was, the Court now said, enough 
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CM/ner J 

1. Bender 1960, p. 2. 

2. The national SAT-V in 1952 was 476, a little more than a Standard devi- 
ation lower than the Harvard mean. Perhaps the average Harvard Student 
was much farther ahead of the national average than the tcxt suggests be- 
cause the national SAT-taking population was so selective, representing 
only 6.8 percent of high school graduates. But one of the oddities of the 
1950s, discussed in more detail in Chapter 18, is that the SAT means re- 
mained constant through rhe decade and into 1963, even as the size of the 
test-taking population mushroomed. By 1963, when SAT scores hit rheir 
all-rime high in rhe post-1952 period, the test-taking population had 
grown to 47-9 percent of all high school graduates. Thus there is reason to 
think that rhe comparison is about the same as the one that would have 
heen produced hy a much larger number of test takers in 1952, 

3. Bender 1960, p. 4. 

4. ln the 1920s, fewer than 30 percent of all young people graduated from 
high school, and the differences hetween the cognitive ability of graduates 
and nongraduates were small, as discussed in Chapter 6. Something he- 
rween 60 and 75 percent of the 18-year-olds in the top IQ quartile never 
even made it into the calculations shown in the figure on page 34- From 
the early 1960s on, 70 percent of the nation’s yourh have graduated from 
high school, and we know that the difference hetween the ability of those 
who do and do not graduate has been large. More concretely, of a nation- 
ally representarive sample of yourh who were adminisrered a highly re- 
garded psychometric test in 1980 when they were 15 and 16 years old, 95 
percent of those who scored in the top quartile suhsequently graduated 
from high school, and another 4 percent eventually got a general equiva- 
lency diploma. The test was the Armed Forces Qualification Test, and the 
sample was the 1964 birth cohort of the National Longitudinal Survey of 
Youth (NLSY), discussed in detail in the introduction to Part II. The fig¬ 
ure for the proportion entering Colleges is based on the NLSY cohorts and 
srudents entering Colleges over 1981-1983. 

5. The top IQ quartile of the NLSY that first attended College in 1981-1983 
was split as follows: 21 percent did not continue to College in the first year 
after graduation, 18 percent went to a two-year College, and 61 percent at- 
tended a four-year College. 

6. O’Brien 1928. These percenrages are based on high school graduates, 
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which accounts for the high percentages of srudents shown as going to Col¬ 
lege in the 1920s. It the estimates had been hased on rhe proportion of the 
18-year-olds who have been graduating front high school since the 1970s, 
those proportions would have been much smaller. The shape of the curve, 
however, would be essentially unchanged (because the IQ distribution of 
students who did not complete high school was so dose to the distribution 
ofthose who did; see Finch 1946). 

7. Another excellent database from the same period, a nationally represen- 
tative sample tested with the Preliminary SAT in 1960 and followed up a 
year later, confirms results from Project TALENT, a large, nationally rep- 
resentative sample of high school youths taken in 1960 (Seihet 1962). 
Among those who scored in the bottom quartile, for example, only 11 per- 
cent went to College; of those in rhe top quartile, 79 percent went to Col¬ 
lege; of those in the top 5 percent, more than 95 percent went to College. 

8. These data are taken from Project TALENT in i960. 

9. From the NLSY, described in the introduction ro Part IL 

10. The test was Form A of rhe Otis. Brigham 1932, Table XVlll, p. 3 36. 

11. The schools are Brown, Bryn Mawr, Columbia, Harvard, Mount Holyoke, 
Princeton, Radeliffe, Smith, University of Pennsylvania (with separate 
means for men and women), Vassar, Wellesley, Williams, and Yale. 

12. Learned and Wood 1938. 

13. Not including the University of Pennsylvania, one of the elite schools. 

14. Between the earliest SAT and 1964, the SAT had divided into a verbal 
and a math score. It is a moot question whether the modern overall SAT 
or the verbal SAT is more comparable to the original SAT In the com- 
parisons being made here, we rely on the Educational Testing Service norm 
studies, which enable us to place an SAT value on the tuuiotuiI 18-year-old 
cohort, not just the cohort who takes the test. We explain the norm stud¬ 
ies in Chapter 18. 

15. This is not the usual SAT distribution, which is ordinarily restricted to 
college-bound seniors, but rather shows the distribution for a nationally 
represenrative sample of all high school seniors, hased on the norm stud¬ 
ies mentioned in note 14. It is restricted to persons still in high school and 
does not include the 34 percent of 18-year-olds who were not. 

16. We know how high the scores were for many schools as of the early 1960s. 
We know Harvard’s scores in the early 1950s. We can further he confident 
that no school was much more selective than Harvard as of 1952 (with the 
possible exception of Science students going to Cal Tech and MIT). Therc- 
fore means for virtually all of the other schools as of 1952 had to he near 
or below Harvard’s, and the dramatic changes for the other elite schools 
had to he occurring in the same comparatively brief period of time coiv 
centrated in the 1950s, 


17. Bender 1960, p. 6. 

18. This porcentage is derived from 1960 data reported by Bender 1960, p. 15, 
regarding the median family i neu me of candida tes who applied for schol¬ 
arship aid, were denied, but came to Harvard anyway. Total costs at Har¬ 
vard in 1960 represented 21 percent of that median. 

19. The families for whom a year at Harvard represented less than 20 percent 
of rheir income constituted approximately 5.8 percent of families in 1950 
and 5.5 percent of families in 1950. Estimated from U.S. Bureau of the 
Census 1975,G-1-15. 

20. The faeulty’s views were expressed in Faculty of Arts and Sciences I960. 

21. Bender 1960, p. 31. 

22. For an analysis of rhe ascriptive qualities that Harvard continued to use for 
admissions choices in the 1980s, see Karen 1991. 

2 3. The increase in applications to Harvard had heen just as rapid from 1952 
to 1958, when the size of the hirth cohorts was virtually constant, as in 
1959 and 1960, when they srarted to increase. 

24. For an analysis of forces driving more recent increases in applications, see 
Clotfelter 1990 and Cook and Frank 1992. 

25. Cook and Frank 1992. 

26. Harvard, MIT, Princeton, Stanford, and Cal Tech were in the top seven 
in all three deeades. Columbia and Chicago were the other two in the 
1960s, Yale and Cornell in the 1970s and 1980s. Cook and Frank 1992, 
Table 3. 

27. Cook and Frank 1992, Table 4. The list of “most competitive” consists of 
the thirty-three schools named by BarrorPs in its 1980 list. The Cook and 
Frank analysis generally suggests that the concentration of top studenrs in 
a few schools may have plateaued during the 1970s, then resumed again in 
rhe 1980s. 

28. ITS. News & World Report, October 15, 1990, pp. 116-134. It is not nec- 
essary to insist. that this ranking is precisely aceurate. Ir is enough that it 
includes all the schools that most people would name if they were asked 
to list the nation’s top schools, and the method for arriving ar the list of 
fifty seems reasonable. 

29. The College Board ethnic and race breakdowns for 1991, availahle hy re- 
q uest from the College Board. The re is also reason to helieve that an ex - 
tremely high proportion of high school students in each senior dass who 
have the potential to score in the high 600s and the 700s on the SAT ac- 
tually take the test. See Murray and Herrnstein 1992. 

30. See Chapter 18 for where the SAT population resides in the national eon¬ 
text. 

31. These represent normal distributions hased on estimates drawn from the 
Learned data that the mean IQ of Pennsylvania graduates in 1930 was ap- 
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proximately two-thirds of a Standard deviation above rhe mean (the mean 
of incoming freshmen was .48 SDs above the mean), and from the Rrigham 
data that the graduates of the Ivy League and Seven Sisters were approxi- 
mately 1.25 SDs above the mean (they were 1.1 SDs above rhe mean as 
freshmen, and the Ivy League graduated extreme ly high proportions of the 
incoming students). 

32. The distrihutions for rhe main groups are based on the NLSY, for youths 
who ca me of College age from 1981 to 1983 and havebeen followed through 
the 1990 interview wave. The top dozen universities are those ranked 1 
through 12 in the U.S. News & World Report survey for 1990. US. News 
& World Report , October 15, 1990, pp. 116-134. The analysis is based on 
published distribution of SAT Verbal scores, which is the more highly g- 
loaded of the SAT suhtests. The estimated verbal mean (weighted by size 
of the freshman dass) for these twenty schools, based on their published 
SAT distrihutions, is 633. The estimated mean for graduates is 650 
(dropout rates for rhese schools are comparatively low but highly con - 
centrated among those with the lowest entering scores). This compares 
with a national SAT Verbal norm estimated at 376 with an SD of 102 
(Braun, Centra, and King, 1987, Appendix B). The distribution in the fig- 
ure on page 46 converts the SAT data ro srandardized scores. The implicit 
assumption is that AFQT (Armed Forces QualiHcatlon Test, an intelli- 
gence test discussed in Appendix 3) and SAT Verbal measure rhe same 
rhing, which is surely wrong to some degree. Both tests are highly g-loaded, 
however, and ir is reasonable to conclude that youths who have a mean 
2.5 SDs above the mean on the SAT would have means somewhere close 
ro that on a fulhfledged mental test. 

55. We have defined these as the first twelve of the listed universities in the 
US. News & World Report listing for 1990. They are (in the Order of their 
ranking) Harvard, Stanford, Yale, Princeron, Cal Tech, MIT, Duke, Darr- 
mouth, Cornell, Columbia, University of Chicago, and Brown. 

34. The probabilities are based on rhe proporrions of people entering rhese 
categories in the 1980s, which means that they become progressively too 
generons for c)leier readers (when the proportion of people getting College 
degrees was smaller). Bur this is a technicality; the odds are already so tiny 
that they are for practical purposes unaffected by further restrictions. The 
figure for College degrees reflects the final educational attainnient of mem- 
bers of the NLSY, who were bom in 1957 through 1964, as of 1990 (when 
the youngest was 25), as a weighted proportion of the NLSY population. 
The figure for Ph.D., law, and medical degrees is based on the number of 
degrees awarded over 1980—1989 expressed as a proportion of the popula¬ 
tion age 26 in each of those years. The figure for graduates of the dozen 
elite schools is based on the number of undergraduate degrees awarded by 


these Institution* in 1989 (the figure Was varied little for many years), ex¬ 
pressed as a proportion of rhe popularion age 22 in 1989 (ineidentally, the 
sinallest cohorr since the mid- 1970s.) 

55. Based on the median percentages for those score inrervals among those 
schools. 

CAuptcr 2 

1. Herrnstein 1975. 

2. For a one-source diseussion of IQs and occupations, see Matarazzo 1972, 
Chap. 7. Alst) see Jencks er al. 1972 and Sewell and Hauser 1975 for com- 
prehensive analyses of particular sets of data. The literature is large and ex- 
rends hack to the early part of the Century. For earlier studies, see, for 
example, Ringham 1957; Clark and üisr 1938; Fryer 1922; Pond 1953; 
Stewart 1947; Terman 1942. For more recent estimates t)f minimum scores 
for a wide variety oi occupations, see E. F. Wonderlic 6c Associates 1983; 
U.S. Department of Labor 1970. 

5. Jencks et al. 1972. 

4- Fallows 1985. 

5. The Fels Longitudinal Study; see McCali 1977. 

6. The correlation was a sizable .5-.6, on a scale that goes from —l to +1. See 
Chapter 5 and Appendix 1 for a füllet* explanation of whar the correlarion 
coefficient means. Job Status for the boys was ahout equally well predicted 
by childhood IQ as by their completed educational levels; for the girls, job 
Status was more correlated with childhood IQ than with educational at- 
tainment. In another study, adult intelligente was also more highly corre¬ 
lated with occupational Status than with educational attainmenr (see 
Duncan 1968). But this may make a somewhat different point, inasmuch 
as adult intelligence may itself be affected by educational attainmenr, in 
contrast to rhe IQ one chalks up at age 7 or 8 years. In yer another study, 
based on Swedish data, adult income (as distinguished from occupational 
Status) was lessstrongly dependent on childhood IQ (age 10) than on even¬ 
tual educational artainment (T. Husen s data presenred in Griliches 1970), 
although being strongly dependent on both. Other analyses come up with 
different assessmentsof the underlying relarionships (e.g., Bowles and Gin- 
tis 1976; Jencks 1979). Not suq^risingly, the empirical picture, heing ex- 
tremely diverse and rieh, bas lent itself to myriad formal analyses, which 
we will make no artempt to review. In Chapters 3 and 4, we present our 
interpretarion of the link between individual ability and uecupation. We 
also discuss some of the evident exceptions to these findings. 

7. Many of the major studies (e.g., Duncan 1968; Jencks et ab 1972; McCali 
1977; Sewell and Hauser 1975) include variables describing familial st)- 
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cioeconomic Status, which prove to he somewhat predictive of u person’s 
own Status. 

8. For a fuller discussion of both the explanation and the controversy, see 
Herrnstein 1973. 

9. Teasdale, Sorenson, and Owen 1984- 

10. The authors of the study offered as an explanation for this pattern of re- 
sults the well-established pattem of resemhlances among relatives in IQ, 
presumably owing to the genes that natural siblings share and tbat adop- 
tive siblings do not share. It could, of course, be traits of personality rather 
than of intellect that tie a family’s occupational destinies together. How- 
ever, the small body of evidence bearing on personality traits finds them 
to be distinctly weaker predictors of job Status than is IQ. Another study, 
of over 1,000 pairs of Norwegian twins, supported the conclusion that the 
resemblance in job Status among dose relatives is large ly explained by rheir 
similarity in IQ and that genes play a significant role in this similarity. See 
Tambs et ab 1989. 

11. For some of the most detailed distributional data, see Stewart 1947, Table 1. 

12. Matarazzo 1972, p. 177. 

1 3. Specific cognitive strengths also vary by occupation, with engineers tend- 
ing to score higher on analytic and quantitative sections of the Graduate 
Record Exams, while English professors do betrer on the verbal portions 
(e.g., Wah and Robinson 1990, Figure 2.2). 

14. With a mean of 100 and SD of 15, an IQ score of 120 cuts off rhe 91 st per- 
centile of a normal distribution. But rhe IQ disrrihution tends to be skewed 
so that it is f'at on the right tail. To say that 120 cuts off the top renth is 
only approximate but close enough for our purposes. 

15. The procedure we used to create the figure on page 56 yielded an estimate 
of 23.2 percent of the top IQ decile in high-IQ occupations in 1990. Of 
the top IQ decile in the NLSY as of 1990, when they ranged in age from 
25 to 32, 22.2 percent of the top decile were employed in rhe dozen high- 
IQ occupations. The analysis excludes those who were still enrolled in 
school in 1990 and those who were in the military (because their occupa- 
tion within the military was unknown). The NLSY figure is an underesti- 
mate (compared to the national estimate) in that those who are still 
students will disproportionately enter high-IQ professions. On the other 
hand, the NLSY would be likely to exceed the national data in the figure 
insofar as the entire NLSY age cohort is of working age, without rerirees. 
One other comment on possible distortions over time: lt might be hy- 
pothesized that, since 1900, the mean has dropped and distribution has 
spread, as more and more people have entered those professions. The plau- 
sibiliry of rhe hypothesis is arguable; indeed, there are reasons for hypoth- 


esizing that the opposite has occurred (for the same reasons educational 
stratification has raised the IQ of students at the elite Colleges). But it 
would not materially affect the plot in rhe figure on page 56 even if true, 
because the numhers of people in those professions were so small in the 
early decades of the Century. It may also be noted that in the NLSY data, 
46 percent of all job slots in the high-IQ occupations were held by people 
in the top decile, again matchingourconjecture about the IQ scores within 
the occupations. 

16. Terman and Oden 1947. 

17. The NLSY cannot answer that question, because even a sample of 11,878 
(the number that took the AFQT) is too small to yield adequate sample 
sizes for analyzing subgroups in the top ternh of the top percentile. 

18. There are not that many people with IQs of 120+ left over, after rhe known 
concentrations of them in the high IQ occupations are taken into accounr. 

19. The literature is extensive. The studies used for this discussion, in addi- 
tion to those cited specifically, include Bendix 1949; Macmahon and Mil¬ 
ten 1939; Pierson 1969; Stanley, Mann, and Doig 1967; Sturdivant and 
Adler 1976; Vance 1966; Warner and Abegglen 1955. 

20. Newcomer 1955, Table 24, p. 68. 

21. Cdews 1908, pp. 27, 37, quoted in Newcomer 1955, p. 66. 

22. The data are drawn from Newcomer 1955. 

23. Burck 1976. The Fortune survey was designed to yield data comparable 
with those in Newcomer 1955. 

24- The ostensible decline in College degrees after 1950 is explained by Col¬ 
lege graduates’ going on to get additional educational credentials. For an- 
other study of educational attainment of CEOs that shows rhe same 
pattern, see Priest 1982. 

25. U.S. Bureau of the Census 1992, Tabtes 18, 615, and U.S. Department of 
Labor 1991, Table 22. 

26. Excluding accountants, who were already counted in the high-IQ profes¬ 
sions. 

27. Matarazzo 1972, Table 7.3, p. 178. 

Ühupter 3 

1. Bok 1985b. In another setting, again discussing the SAT, he wrote, “Such 
tests are only modestly correlated wirb subsequent academic success and 
give no reliable indication of achievement in later life” (Bok 1985a, p. 15). 

2. The correlation of IQ with income in a restricted population such as Har¬ 
vard graduates could be negative when people toward the top of the IQ 
distribution are disproportionately drawn into academia, where they make 
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a decent living but seldom much more than that, while students with lQs 
of u on[y” 120 and 130 will more often go inro rhe business worid, where 
they may get rieh. 

3. See Chapter 19; Dunnette 1976; Ghiselli 1973. 

4. Technically, -a correlation eoeff ident is a rat io, with the co Variation of the 
two variables in the numerator and the product of the separate Standard 
deviations of the two variables in the denominator. The formula for Com¬ 
puting a Pearson product moment correlation r (the kind that we will be 
using throughout) is: 


X(x-x)(y-y) 


I(x-x)-’][x(y- y )-' 


where X and Y refer to the actual values for each case and X and V refer to 
the me an values of the X and Y , respecrively. 

5. We limited the sample to families making less than $100,000, so as to avoid 
some distracting technical issues that arise when analyzing income across 
the entire Spectrum (e.g., the appropriateness of using logged values rarher 
rhan raw values). The results from the 1 pereent sample are in line with 
the statistics produced when the analysis is repeated for the entire national 
sample: a correlation of .31 and an increment of $2,700 per year of addi¬ 
tional education. Income data are for 1989, expressed in 1990 dollars. 

6. An important distinction: The underlying relationship persisrs in a sam¬ 
ple with restricted ränge, but the restriction of ränge makes the relation¬ 
ship harder to identify (i.e., the correlation coefficienr is attenuated, 
sometimes to near zero). 

Forgetting about restriction of ränge produces fallacious reasoning that 
is remarkably common, even among academics who are presumably famil¬ 
iär with the problem. For example, psychologist David McClelland, writ- 
ing at the height of the anti-lQ era in 1973, argued against any relationship 
between career success and IQ, pointing out that whereas College gradu- 
ates got hetter jobs than nongraduates, the academic records of graduates 
did not correlate with job success, even rhough College grades correlate 
with IQ. He added, anecdotally, that he recalled his own College dass— 
Wesleyan University, a top-rated small College—and was convinced that 
the eight best and eight worst students in his dass had not done much dif- 
ferently in their subsequent careers (McClelland 1973). This kind of ar- 
gument is also common in everyday life, as in the ad vice offered by friends 
during the course of writing this hook. There was, for example, our friend 
the nuclear physicist, who prefaced his remarks by saying, “I don’t think 
Fm any smarter than the average nuclear physicist . . Or an engineer 


friend, a key figure in the Apollo lunar landing program, who insisted that 
this IQ business is much overemphasized. He had been a C Student in Col¬ 
lege and would not have even graduated, except that he managed to pull 
himsdf together in his senior year. His condusion was that motivation was 
important, not IQ. Did he happen to know what his IQ was? Sure, he 
replied. It was 146. He was right, insofar as motivation can make the dif- 
ference between being a first-rate rocket scientisr and a mediocre one—if 
you Start with an IQ of 146. Hut the population with a score of 146 (or 
ahove) represents something less than 0.2 pereent of the population. Sirn- 
ilarly, corrdations of IQ and job success among College graduates suffer 
from restriction of ränge. The more selective the group is, the greater the 
restriction, which is why Derek Bok may plausibly (if not quite accuratdy) 
have claimed that SAT scores have “no correlation at all with what you 
do in the rest of your life” if he was ralking about Harvard students. 

7. E.g., Fallows 1985. 

8. See Chapter 20 for more detail. 

9. üriggs v. Dula> Power, 401 U.S. 424 (1971). 

10. The doctrine has heen huilt into the U.S. Employment and Training Ser¬ 
vices General Aptitude Test Battery (GATB), into the federal civil Ser¬ 
vices Professional and Administrative Career Examinarion (PACE), and 
into rhe military’s Armed Services Vocational Aptitude Battery (ASVAB). 
Bartholet 1982; Braun 1992; Gifford 1989; Kelman 1991; Seymour 1988. 
For a survey of test instruments and their use, see Friedman and Williams 
1982. 

11. For a reeent review of the expert Community as a whole, see Schmidt and 
Ones 1992. 

12. Hartigan and Wigdor 1989 and Schmidt and Hunter 1991 represent the 
two ends of the ränge of expert opinion. 

1 5. For a sampling of the new methods, see Bangert-Drowns 1986; Glass 1976; 
Glass, McGaw, and Smith 1981; Hunter and Schmidt 1990. Meta-analytic 
Strategien had heen tried for decades prior ro the 1970s, but it was alter the 
advent of powerful Computers and Statistical Software that many of rhe 
techniques became practicable. 

14- Hartigan and Wigdor 1989; Hunter and Schmidt 1990; Schmidt and 
Hunter 1981. 

15. We have used the terms job produeävity or job perfannance or performance 
ratings with out explaining what they me an or how they are measured. On 
the other hand, all of us have a sense of what job productivity is tike—we 
are confident that we know who are the hetter and worse secretaries, man- 
agers, and colleagues among those with whom we work closely. But how is 
this knowledge to be captured in objective measures? Ratings by Supervi¬ 
sors or peers? Samples of work in the various tasks that a job demands? Tests 
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of joh knowledge? Job tenure or promotion? Direct cost accounting of 
workers’ output? There is no way to answer such a quesrion decisively, for 
people may legitimately disagree about what it is abour a workers perfor¬ 
mance that is most worth predicting. As a practical matter, ratings hy Su¬ 
pervisors, being the most readily obtained and the least inrrusive in the 
workplace, have dominated the literature (Hunter 1986). Rut ir is natural 
to wonder whether Supervisor ratings, besides being easy ro ger, truly mea- 
sure how well workers perform rather than, say, how they get along wirb 
the boss or how they look (Guion 1983). 

To get a better fix on what the various measures of performance niean, 
it is useful to evaluate a number of studies that have included measures ot 
cognitive ability, Supervisor ratings, samples of work, and tests of joh knowl¬ 
edge. Work samples are usually obtained by setring up stations for workers 
to do the various tasks required by their jobs and having rheir work eval- 
uated in some reasonably objective way. Different oecupations lend thenv 
selves more or less plausibly to this kind of simulated performance. The 
same is true of written or oral tests of joh knowledge. 

One of the field’s leaders, John Hunter, bas examined the correlational 
structure that relates these different ways of looking at job performance to 
each other and to an intelligence test score (Hunter 1983, 1986). ln a study 
of 1,800 workers, Hunter found a strong direct link between intelligence 
and job knowledge and a mach smaller direct one between intelligence 
and performance in work sample tasks. By direct we mean that the vari¬ 
ables predict each other without taking any other variable into account. 
The small direct link between intelligence and work sample was aug- 
mented by a large indirect link, via job knowledge: a person’s intelligence 
predicted bis knowledge of the joh, and his knowledge in turn predicred 
his work sample. The correlation (after the usual Statistical corrections) 
between intelligence and job knowledge was .8; between intelligence and 
work sample it was .75. The indirect link befween intelligence and work 
sample, via joh knowledge, was larger hy half than the direct one (Hunter 
1986). 

The correlation between intelligence and Supervisor ratings in Hunters 
analysis was .47. Upon analysis, Hunter found that the primary reason is 
that brighter workers know more abour their jobs, and Supervisors respond 
fa vorab ly to their knowledge. A comparable analysis of approxi mately 
1,500 mllitary personnel in four specialties produced the same hasic find- 
ing (Hunter 1986). This may seem a weakness of the Supervisor rating mea- 
sure, but is it really? How much workers know about their jobs correlates, 
on the one hand, wirb their intelligence und, on the other, with both how 
they do on direct tests of their work and how they are rated by their Su¬ 


pervisors. A workers intelligence influences how much he learns about the 
joh, and joh knowledge eontrihutes to proficiency. The knowledge also in- 
fluences the impression the worker inakes on a Supervisor rating more than 
the work as measured by a work sample test (which, of course, the Super¬ 
visor may never see in the ordinary course of husiness). Using Supervisor 
rating as a measure of proficiency is thereby justified, without having to 
claim that the rating directly measures proficiency, 

Hunter found that work samples are more dependenr on intelligence 
and joh knowledge than are Supervisor ratings. Supervisor ratings, which 
are so predominant in this literature, may, in other words, underesrimate 
how important intelligence isfor proficiency. Recent researchsuggests that 
Supervisor ratings in facr do underestimate the correlation between intel¬ 
ligence and productivity (Becker and Huselid 1992). Rut we should ac- 
knowledge again that none of the measures of proficiency—work samples, 
Supervisor ratings, or joh knowledge tests—is free of the taint of artifieial- 
ity, 1er alone arbitrariness. Supervisor ratings may be biased in many ways; 
a test of joh knowledge is a test, not a job; and even a worker going from 
one work Station to another under the watchful eye of an industrial psy¬ 
chologist may he revealing somerhing other than everyday competence. It 
has been suggested that the various contrived measures of workers teil us 
more about maximum performance than they do about typical, day-to-day 
proficiency (Guion 1983). We therefore advise that the quantitative esti- 
mates we present here (or that can he found in the technical literature at 
large) he considered only tentative and suggestive. 

16. The average validity of .4 is obtained after Standard Statistical corrections 
of various sorrs. The rwo most important of these are a correction for test 
unreliability or measurement error and a correction for restriction of ränge 
among the workers in any occupation. All of the validities in this section 
of the chaprer are similarly corrected, unless otherwise noted. 

17. Ghiselli 1966, 1973; Hunterand Hunter 1984, Table 1. 

18. Hunter 1980; Hunter and Hunter 1984. 

] 9. Where availahle, ratings hy peers, tests of job knowledge, and actual work 
samples often come close to ability measures as predictors of joh perfor¬ 
mance (Hunter and Hunter 1984). But aptitude tests have the practical 
ad van tage that they can be admi niste red relative ly inexpensively to large 
numbers of applicants, and they do not depend on applicants’ having been 
on the joh for any length of time. 

20. E. F. Wonderlic & Associates 1983; Hunter 1989. These valid ities, which 
are even higher than the ones presenred in the table on page 74 are for 
training success rather than for measures of job performance and are more 
directly comparable with the column for training success in the GATR 
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Studios than the colutnn for job proficiency. Regarding joh performance, 
onc major study evaluatod the performance of ahout 1,500 air force erv 
listed men and women working in eight milirary specialties, chosen to be 
representative of military specialties in the air force. Performance was van 
iously measurecl: by defining a set of tasks involved in each job, then train- 
ing a group of evaluators to assess rhose specific tasks; by interviews of the 
personnel on technical aspecrs of their jobs; by Supervisor ratings afrer 
train ing the Supervisors; and combinations of methods. The average cor' 
relation between AFQT score and a hands-on job performance mcasure 
was .40, with the highesr among rhe precision measurement equipment 
specialists and the avionics Communications specialists and the lowest 
among the air traffic control operators and the air crew life support spe¬ 
cial ists. Insofa r as rhe jobs were restrieted to those he Id by enlisted men, 
the distribution of jobs was somewhat skewed toward the lower end of the 
skiil ränge. We do not have an availahle estimate of rhe validity of rhe 
AFQT over all military jobs. 

21. Hartigan and Wigdor 1989. 

22. It is one of the chronically frustrating experienees when reading scienrific 
results: Two sets of experts, supposedly usmg comparable data, come out 
with markedly different condusions, and the reasons for the differences are 
buried in technical and opaque language. Hovv is it possible for a layper- 
son to decide who Ls right? The different estimates of mean validity of the 
GATB—.4^ according to Hunter, Schmidt, and some others; .25 accord- 
ing to the Hartigan cornmittee—is an instructive case in point. 

Sometimes the differences real ly are technical and opaque. For exam- 
ple, the Hartigan cornmittee based its estimate on the assumprion t har the 
reliability of Supervisor ratings was higher than other studies assumed—.8 
instead of .6 (Hartigan and Wigdor 1989, p. 170). By assuming a higher re- 
liability, rhe cornmittee s correction for measurement error was smaller 
than Hunter’s. Deciding between the Hartigan commitree’s use of .8 as the 
reliability of Supervisor ratings instead of the .6 used by Hunter is impossi- 
ble for anyone who is not intimately familiär with a large and scattered lit- 
erature on that topic, and even then the choice remains a matter of 
judgment. But the Hartigan committee’s decision not to correct for re- 
striction of ränge, which makes the largest difference in their estimates of 
the overall validity, is based on a much different kind of disagreement. 
Here, a layperson is as qualified to decide as an expert, for this is a dis- 
agreement about whar question is being answered. 

John Hunter and others assumed that for any job the applicant pool is 
the entire U.S. work force. That is, they sought an answer to the question, 
“What is the re lat ionship between job performance and intelligence for 
the work force at large?” The Hartigan cornmittee ohjected to their as- 


sumption on grounds that, in practice, the applicant pool for any particu- 
lar job is not the entire U.S. work force but people who have a chance to 
get rhe job. As they accurateiy noted, “People gravi täte to jobs for which 
they are potential ly suited” (Hartigan and Wigdor 1989, p. 166). 

But embedded in the committee’s ohjecrion to Hunter’s estimates is a 
tacit switch in the question that the analysis is supposed to answer. The 
Hartigan cornmittee sought an answer to the question, “Among those peo- 
ple who apply for such-and-such a position, what is rhe re lat ionship be- 
tween intelligence and job performance?” lf one’s objective is not to 
discourage people who weigh only 250 pounds from apply ing for jobs as 
tackles in the NFL, to return to our analogy, then the Hartigan commir- 
tee’s question is the appropriate one. Of course, by minimizing the valid¬ 
ity of weighr, a large number of 1 50-pound lineman may apply for the jobs. 
Thus our reasons for c onc lud ing that the assumption used by Hunter and 
Schmidt (among others), that restriction of ränge calculations should be 
based on the entire work force, is self-ev ident ly the appropriate choice if 
one wants to know the overall relarionship of IQ to job performance and 
its economic consequences. 

2T The ASVAB comprises ten subtesls: General Science, Arithmetic Rea- 
son ing, Word Knowledge, Paragraph Comprehension, Numerical Opera - 
tions, Coding Speed, Auto/Shop Information, Mathematics Knowledge, 
Meehanical Comprehension, and Electronics Information. Only Numer- 
ical Operations and Coding Speed are highly speeded; the other eight are 
nonspeeded “power” tesrs. All the armed Services use the four MAGE com- 
posites, for Meehanical, Administrative, General, and Electronics spe¬ 
cial ties, each of which includes rhree or four suhrests in a particular 
weighting. These composites are supposed to predict a recruit’s trainabil- 
ity for the particular specialty. The AFQT is yer another composite from 
the ASVAB, selected so as to measure g efficiently. See Appendix T 

24. Ahout 80 percent of the sample had graduated from high school and had 
no further civil ian school ing, fewer than 1 percent had failed to graduate 
from high school, and fewer than 2 percent had graduated from College; 
the remainder had some post-high school civilian schotding short of a Col¬ 
lege degree. The modal person in the sample was a white male hetween 19 
and 20 years old, hut the sample also included thousands of women and 
people from all American ethnic groups; their ages ranged from a mini- 
mum of 17 to almost 15 percent above 23 years (see Ree and Earles 1990h). 
Other studies, using educationally heterogeneous samples, have in fact 
shown that, holding AFQT constant, high school graduates are more likely 
to avoid disciplinary action, ro be recommended for reenlistment, and to 
he promoted to higher rank than nongraduates (Office of the Assistant 
Secretary of Defense 1980). Current enlistment policies reflect the inde- 
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pendent predictiveness of education, in that of two applicants with equal 
AFQT score, the high school graduate is selected over the nongraduate if 
only one is to he accepted. 

25. In fact, there may be some upward bias in these correlations, inasmuch as 
they were not cross validated to exclude Capitalization on chance. 

26. What does it mean to “account for the observed Variation 7 Think of it. in 
this way: A group of recruits finishes its training course; their grades vary. 
How much less would they have varied had they entered the course wirb 
the same level of g?This may seem like a hypothetical question, hur it is 
answered simply by squaring the correlation hetween the recruits’ level of 
g and their final grades. In general, given any two variables, the degree to 
which Variation ineither isexplained (oraccounted for, in Statistical lingo) 
by the other variable is obtained by squaring the correlation between them. 
For example, a perfect correlation of 1 between two variables means that 
each of the variables fully explains the observed variations in the other. 
When two variables are perfectly correlated, they are also perfeetly re¬ 
dundant since if we know the value of one of them, we also know the value 
of the other without having to measure it. Hence, 1 squared is 1.0 or 100 
percent. A correlation of .5 means that each variable explains, or accounts 
for, 25 percent of the observed Variation in the other; a correlation of 0 
means that neither variable accounts for any of the observed Variation in 
the other. 

In the Ree and Earles study, over all eighty-nine occupational schools, 
the average value of this square correlation was 58 percent (which corre- 
sponds to a correlation of .76). g, in other words, accounted for almost 6Ö 
percent of the observed Variation in school grades in the average military 
course, once the results were corrected for ränge restriction. Even without 
a correction for ränge restriction, g accounted for over 20 percent of the 
variance in school grades on the average (corresponding to a correlarion 
of .45). 

27. Welsh, Watson, and Ree 1990. 

28. Jones 1988. A similar analysis was performed for job performance but, be- 
cause of the expense of obtaining special periormance measures, with a 
much smaller sample (1,545) spread across just eight enlisred job special- 
ties (Ree and Earles 1991). The correlations with g in this study did not 
reach the extraordinarily high levels of predictiveness as for school grades, 
and the other cognitive factors were relatively more important for job per¬ 
formance than for school grades—points to which we shall retum. But 
combining the results with the previously cited job performance study of 
air force personnel (Office of the Assistant Secretary of Defense for Force 
Management and Personnel 1989), the job predictiveness of AFQT for the 
specialties is correlated above .9 with the job predictiveness of g. Using the 


highest of the various correlations between joh performance measures and 
g, rhe product-moment correlation is .97 and the Spearman rank-order cor¬ 
relation is .93. ln other words, in predicting joh performance, at least for 
these jobs and these performance tests, rhe validity of an AFQT score is 
virrually enrirely explained by how wel l it measures g, perse. 

29. Thorndike 1986. The comparison is between the predictiveness of the first 
factor exrracted by factor analysis of the f ive cogn iti ve suhtests of G ATB ver¬ 
sus the regression-weighted subtest scores themselves, for cross-validating 
samples of ar least fifty workers in each of the twenty-eight occupations. 

30. Hawk 1986;jensen 1980, 1986; Linn 1986. 

31. For the linear relationship of cognitive ahility, sce Schmidt, Ones, and 
Hunter 1992. For the nonlinear relationship of job experiences see 
Blankenship and Taylor 1938; Ghiselli and Brown 1947; Taylor and Smith 
1956. 

32. Hawk 1970; Hunter and Schmidt 1982. 

33. Humphrcys 1968, 1973; Wilson 1983. 

34- See p. 66. 

35. Butler and McCauley 1987. 

36. McDaniel, Schmidt, and Hunter 1986. 

37- Schmidt et al. 1988. 

38. Maier and Hiatt 1985. 

39. This story echoes the mixed findings for the learning of simple tasks in the 
psychological laborarory. Depending on which measures are used ro pre- 
dict performance and which tasks are he ing predicred, one can expect ei- 
ther to see convergence of performance with practice, or no convergence, 
or even divergence under some cireumstances. See Ackermann 1987. 

40. Schmidt et al. 1988. No data have yet tested rhe possibility that produc- 
tiviry diverges (the advantage enjoyed by rhe smarter employee increases 
with experience) in very-high-complexiry jobs. 

41- See also Schmidt et al. 1984- 

42. See rhe cliscussion in note 1 5. 

43. Burke and Frederick 1984; Hunter and Schmidt 1982; Hunter, Schmidt, 
andjudiesch 1990; Schmidtand Hunter 1983; Weekleyetal. 1985. ln the 
Technical literature, the Standard deviation of productivity measured in 
dollars is represented as SD^ and has generally been estimared to average, 
over many different occupations, .4 times the average wage for the joh. The 
correspond ing figure as a proportion of the value of the average worker out¬ 
put is .2. Methods for estimating these distrihutions are discussed in the 
cited references, but they include such techniques as Supervisor vatings of 
the dollar costs of replacing workers at various points in the distribution 
of workers, cost accounting of worker product, and scores on proficiency 
tests and at work sample stations. 
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44. Becker and Huselid 1992. 

45. The more Contemporary esrimate would place this value at ahout $ 16,000 
rather than $8,000. All the other dollar estimates of the benefits of test- 
ing mentioned in this secrion could similarly he douhled. 

46. Hunter, Schmidt, and Judiesch 1990. 

47. Wo use rounds numhers to make the calculations easy to follow, hur these 
are in fact close to the current medians. 

48. Hunter, Schmidt, and Judiesch 1990. 

49. 25,000x .15 = 3,750; 100,000x .5 = 50,000; 50,000/5,750 = 13.33. 

50. 100,000 x.5x.6 = 30,000; 25,000 x .15 x .2 = 750. 

51. There is another point illustrated by this exercise. Recall that a validity 
(correlation) “explains” only the amount of variance equal to its square; 
hence a validity of .4 explains only 16 percent of the variance, and this of- 
fers a temptarion to dismiss the importance of Lntelligence as heing of neg- 
ligible economic consequences. And yet when we calculated the gains to 
he realized from an ability test that is less than perfeerly valid as a predic- 
tor of proficiency, we multiplied the gain from a perfect rest hy the valid¬ 
ity, not by the square of the validity. When trying to estimate how mach 
of the value of a perfect selection procedure is captured by an imperfect 
suhstitute, the validity of the imperfect test is equal to the proportion of 
the value that is captured by it. A test with a validity of .4 captures 40 per¬ 
cent of the value thar would he realized from a perfect test, even though it 
explains only 16 percent of the variance. Readers interested in the math- 
ematical proof, which was first derived in the 1940s, will find it in Hunter 
and Schmidt 1982. 

52. Two of the classic discussions of the conditions ander which test ing pays 
oft are Brogden 1949 and Cronbach and Gleser 1965. 

5 h These correlations cover the empirical ränge in two senses. First, rhey 
hracket the values found in the technical literature deal ing with the pre- 
dicriveness of intelligence. Second, they hracket the various occupations, 
as deserihed hy Hunter, Schmidt, and rheir colleagues. More complex jobs 
have higher correlations hetween intelligence and proficiency, hat almost 
all common occupations fall in the ränge hetween .2 and .6, The graphs 
assume normaliry of the predictor and outcome variables and a linear re- 
lation hetween them. None of these assumptions needs to he srrtetly met 
in order for the figure to give at least an approximately correct account of 
the relationships, nor are there any known deviations from normal iry or 
linearity thar would material ly alrer the account. 

54- We estimate the percentile values hy assuming that proficiencies are nor¬ 
mal ly distributed. 

55. Hunter and Hunter 1984; Schmidt, Mack, and Hunter 1984. 

56. Harrigan and Wigdor 1989; Hunter and Hunrer 1984- 


57. The data for the follow ing description come from Herrnstein, Belke, and 
Taylor 1990. 

58. Hunter 1979. 

59. Murphy 1986. 

Charter 4 

1. Juhn, Murphy, and Fierce 1990; Kurz and Murphy 1900. 

2. Twenty-three percent for sixteen or more years of education versus 11 per¬ 
cent tor tweive or fewer years, according to Katz and Murphy 1990. 

h Freeman 1976. 

4. The wagedecline in the 1970s for highly educated workersand in the 1980s 
for less educated workers could conceivably have heen due ro declines in 
rhe quality of College education in the earlier period and in primary and 
high school education in the later period or in corresponding changes in 
the skills of people at those levels of education, as reflected, for example, 
in the decline of SAT scores (Bishop 1989). Economists assessing this hy- 
pothesis have concluded that ir could not have piayed a major role (see 
Black hum, Bloom, and Freeman 1990; Juhn, Murphy, and Fierce 1990; 
Kan and Murphy 1990). 

5. The dramatic growth of female work force participation would necessitate 
complex modeling to address for the Lahor force as a whole the question 
here dealt with just for men. 

6. Comparing men with sixteen or more years in school to those with fewer 
than tweive years gives a 26.8 percent differential and to those with tweive 
years in school gives 29.8. Since each eategory is being compared to itsown 
baseline, this calculation understates rhe size of the change in actual real 
wages. 

7- In a slightly different approach to the data, Kevin Murphy and Finis Welch, 
restricting the analysis to white workers, also found that more education 
had a shrinking wage henefit from 1963 to 1979, followed hy a steeply ris- 
ing benefit, hut only for new workers. For experienced workers, the wage 
benefit for education did not decline during the earlier period, then rose 
more modestly thereafter. Work experience, in other words, dampened the 
wage benefit for education from the 1970s to the 1980s (Murphy and 
Welch 1989. See also Murphy and Welch 1993a, 1993b). 

8. Thar intelligence is confounded with educational atrainment is hardly a 
new idea. See Arrow 1973; Herrnstein 1973; Jencks et ab 1972; Sewell and 
Hauser 1975. 

9. Juhn, Murphy, and Fierce 1990; Katz and Murphy 1990. 

10. Fublic employment shielded workers, especially female workers, from the 
rising wage premium for education in the 1980s and the rising premium 
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for unmeasured individual characteristics, presumably including intelli¬ 
gence. In the upper half of the wage distrihurion for high ly educated work- 
ers, the ratio of federal to private wages declined from 1979 to 1988, even 
after corrections for race, age, and region of the country (Cutler and Katz 
1991). The decline was especially large for women, perhaps hecause edu¬ 
cated woinen were finding relatively more lucrative alternatives outside 
the govemment. For less educated workers in the lower halt of the wage 
distribution, the ratio of federal to private wages rose during that intecval, 
again especially for women. For state and local (as distinguished from fed¬ 
eral) public employees, the rise in the ratio of public to private wages for 
less educated workers was larger still. 

11. “Residual” in the regression analysis sense. After accounting for the effects 
of education, experience, gender, and their various interactions, a certain 
amount of real wage variance remains unexplained. This is the residual 
that has been growing. 

12. Juhn, Murphy, and Pierce, 1990; Katz and Murphy 1990; Levy and Mur- 
nane 1992. 

13. Juhn, Murphy, and Pierce 1990, 

14- Diligence, or conscientiousness, is one noncognitive trait that appears to 
eam a wage premium (Schmidt and Ones 1992). Drive, ambition, and so- 
ciability have heen examined by Filer (1981). None of these has been as 
well established as cognitive ability, nor do they appear to be as significanr 
in their economic effects. 

15. Blackburn and Neumark 1991. 

16. Blackburn and Neumark 1991. This study used the National Longitudinal 
Survey of Youth (NLSY), a database described in the introduction to Part 
II. 

17. Lest we convey the false impression that we are suggesting that education 
per se is irnmaterial, once intelligence is taken account of, we note two in- 
genious studies by economists Joshua Angrist and Alan Krueger (Angrist 
and Krueger 1991a, 1991b). They examined wages in relation to school- 
ing for school dropouts born at different times of year and for people with 
varying draft lottery numbers. Dropouts in many States must remain in 
school until the end of the academic year in which they reach a given age. 
For people who want to drop out as soon as possible, those bom in, say, Oc- 
roher will spend a year in school more than those born in January. Like- 
wise, during the Vietnam era, people whose only reason for staying in 
school was to avoid the draft would get more schooling if they had low lot- 
tery numbers, making them more likely to be drafted, than if they had high 
numbers. In both populations, the extra schooling showed a wage benefit 
later on. These findtngs show effects of education above and beyond per¬ 
sonal traits like intelligence, if we assume that intelligence is uncorrelated 


with the month in which one is born or the lottery number. ln fact, hu¬ 
man births are moderately seasonal, and the seasonality differs across races, 
ethnic groups, and socioeconomic Status, which may mean that births are 
seasonal with respect to average intelligence (Lam and Miron 1991). No 
such complication confounds the study using lottery numbers. Even so, the 
generality of these findings for populations other than school dropouts and 
for people who stayed in school only to avoid being drafted remains to be 
estahlished. 

18. Again from the NLSY. The sample chosen for this particular analysis was 
at least 30 years old, had been out of school for at least a year, and had 
worked fifty-two weeks in 1989 (from Top Decile Analysis). The median 
(as distinguished from the mean) difference in annual wages and salaries 
was much smaller: $3,000. A bulge of very-high-income individuals in 
these uccupations among those with high lQs explains the gap between 
the mean and the median. For example, in these occupations, among those 
in the top decile of IQ, the 97.5th percentile of annual income was over 
$180,000; for those not in the top IQ decile, the corresponding income 
was $62,186. 

19. The median wage for each occupation is the wage that has as many wages 
above ir as below it in the distribution of wages in the occupation. A me¬ 
dian expresses an average that is relatively insensitive to extreme values at 
eirher end. 

20. A high IQ is also worth extra income outside the high-IQ occupations as 
we defined them. The wages and salaries of people not in the high-IQ oc¬ 
cupations but with an IQ in the top 10 percent earned over $11,000 more 
in 1989 (again in 1990 dollars) rhan those with IQs helow the top decile. 
The median family income of those in the top IQ decile who did not en¬ 
ter the high-IQ professions was $49,000, putting them at the 72d percentile 
of family ineomes. 

21. Solon 1992; Zimmerman 1992. Women are not usually includec! in these 
studies hecause of the analyticcomplications arising in the recent dramatic 
changes in their work force participation. The correlation is even higher 
if the predictor of the son’s income is the family income rather than just 
the fathers (Solon 1992). These estimates of the correlation between fa¬ 
che r and sun income represent a new finding. Until recent ly, special ists 
mostly agreed that income was not a strong family trait, certainly not like 
the family chin or the baldness that passes on from generation to genera- 
tion, and not even as enduring as the family nest egg. They had concluded 
that the correlation between fathers and sons in income was between .1 
and .2—very low. Expert opinion has, however, heen changing. The older 
estimates of the correlation between fathers’ and sons’ ineomes, it turns 
out, were plagued by two familiär problems that artificial ly depress corre- 
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lation coefticients. First, rhe populations used for gathering the estimates 
were unrepresentative. One large study, for example, used only high school 
graduates, which no douht restricted the ränge of IQ scores (Sewell and 
Hauser 1975). Another prohlem has been measurement error—in the case 
of intergenerational comparisons of ineome, measurement error intro- 
duced by basing the analysis on a single year’s ineome. Averaging ineome 
over a few years reduces this source of error. Now, using the national ly rep¬ 
räsentative, longitudinal data in the National Longitudinal Survey (NLS) 
and the Panel Study of Ineome Dynamics (PSID), economists have found 
the correlations of .4 to .5 reported in the text. 

22. Solon 1992. For comparable estimates for Great Britain, see Arkinson, 
Maynard, and Trinder 1983. 

23. U.S. Bureau of the Census 1991b, Table 32. 

24. Herrnstein 197.3, pp. 197-198. 

25. For reviews of the literature as of 1980, see Bouchard 1981; Plomin and 
De Fries 1980. For more recenr analyses, on which wo base the upper hound 
estimate of 80 percent, see Bouehard et al. 1990; Pedersen er ai. 1992. 

26. Plornin and Loehlin 1989. 

27. The proper Statistical measure of Variation is the Standard deviation 
squared, which is called the variance. 

28. Heritabiliry is a concept in quantitative genetics; tor a good textbook, see 
Falconer 1989. 

29. Social scientists will recognize the heritabiliry question as being akin to 
the general Statistical model of variance analysis. 

30. Plomin and Loehlin 1989. 

31. Bouchard er al. 1990. 

32. Estimating heritahilities from any relationship other than for identical 
twins is inherenrly more uncertain because the modeling is more comp lex, 
involving the esrimation of additional sources of genetic Variation, such as 
assortative mating (about which more helow) and genetic dominante and 
epistasis. See Falconer 1989. 

3 3. For a broad survey of all kinds of data published before 1981, sec into sev- 
eral Statistical models, the best fitting of which gave .51 as the estimate of 
IQ heritability, see Chipuer, Rovine, and Plomin 1990. Most of the data 
are from Western countries, but a recent analysis of Japanese data, hased 
on a comparison of identical and fratemal twin correlations in IQ, yields 
a heritability estimate of .58 (Lynn and Hatrori 1990). 

34- The extraordinary discrepancy between whar the experts say in their rech- 
nical publications on this suhject and what the media say the experts say 
is well described in Snyderman and Rothman 1988. 

35. Cyphers et al. 1989; Pedersen et al. 1992. 

36. Cyphers et al. 1989; Pedersen et ab 1992. 


37. Based primarily on a large study of Swedish identical and fraternal twins 
tollowed into late adulrhood (Pedersen et al. 1992). 

38. Plomin und Bergeman 1987; Rowe and Plomin 1981. 

39. IQ is not the only trait with a biological component thar varies across so- 
cioeconomic strata. Height, head size, blood type, age at menarche, sus- 
ceptihility to various congenital diseases, and so on are some of the other 
rrairs for which there is evidence of social dass differences even in racially 
homogeneous societies (for review, see Mascie-Taylor 1990). 

40. The Standard deviation squared times the heritability gives variance due 

just to genes; rhe square root of that numher is rhe Standard deviation of 
IQ in a world of perfectly uniform environmenrs: 15^ x .6) = 11.6 A 

heritahility of .4 would reduce the Standard deviation from the normative 
value of 15 to 9,5; with a heritability of .8., ir would he reduced to 1 3.4. 

4L lf we take the heritability of IQ to be .6, then the swing in IQ is 24 points 
for rwo children with identical genes, hur growing up in circumsrances thar 
are at, say, the lOth and the 90th cenrile in their capacity to foster intel- 
ligence, a very large swing indeed. A less extreme swing from the 40th to 
the 60rh centile in environmental condirions would niove the average IQ 
only 4-75 poinrs. ln a normal distribution, the distance from the lOth to 
rhe 90th percenrile is about 2.5 Standard deviation unirs; from the 40th co 
rhe 60th percenrile, it is about .5 Standard deviation units. lf the heri¬ 
tability is .8, insread of .6, then the swing from rhe lOth tu the 90th per- 
centile would he worth 17 IQ points, from the 40th to the 60th, 3.4 IQ 
points. 

42. Burgess and Wall in 1943, 

4.3. Spuhler 1968. 

44. Jensen 1978. This esrimate may be high for a variety of Technical reasons 
rhar are still being explored, but apparently not a lot too high. For more, 
see DeFries et al. 1979; Mascie-Taylor 1989; Mascie-Taylor and Vanden- 
berg 1988; Price and Vandenberg 1980; Watkins and Meredith 1981. ln 
the 1980s, some researchers argued that data from Hawaii indicated a 
fa Hing ievel of assortative mating for IQ, which they attributed to increased 
social mobility and greater access to higher education (Ahern, Johnson, 
and Cole 1983; Johnson, Ahern, and Cole 1980; Johnson, Nagoshi, and 
Ahern 1987). But the evidence seems to be limited to Hawaii. Other re¬ 
cent data from Norway and Virginia, not to mention the national census 
data developed by Mare and discussed in the text, fail to confirm rhe 
Hawaii data (Heath et ab 1985, 1987). When Intelligence and educational 
level are statistically pulled apart, the assortative mating for education, net 
of intelligente, is stronger than that for intelligente, net of educational 
level (Neale and McArdle 1990; Phillips et ab 1988). 

45. For a discussion of regression to the mean, see Chapter 1 5. The calcula- 
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rion in rhe texr assumes a corrclation of +.8 herween rhe average childs 
IQ and rhe midpoint of the parental [Qs, consistent wich a heritahility of 
.6 and a family environment eff ec t of .2. The estimate of average IQs in 
1930 is explained in Chapter 1. The estimate for the dass of 1964 (who 
were freshmen in 1960) is hased on Harvard SAT-Verbal scores compared 
to the Educational Testing Services national norm study conducted in 
1960, which indicates that the mean verhal score lor entering Harvard 
freshmen was 2.9 SDs ahove the mean of all high school seniors—and, hy 
implication, considerahly higher rhan that for the entire 18-year old co- 
hort (which includes the high school dropouts; Seibel 1962, Bender 1960). 
lf we estimate the corrclation hetween the SAT'Verhal and IQ as +.65 
(from Donlon 1984), the estimated mean IQ of Harvard freshmen as of 
1960 was about 130, from which the estimate of children’s IQ has heen cah 
culated. 

46. With a parent-child corrclation of .8, 64 percent ot the variance is ac- 
counted for, 36 percent not accounted for. The square root of .36, which 
is .6, rimes 1 5, is the Standard deviation of rhe distrihurion of IQ scores of 
the children of these parents. This gives a value of 9, from which the per- 
centages in the text are estimated. 

47. Operational ly, Mare compared marriage among people with sixteen or 
more years of schooling with those who had fevver than sixteen years of 
schooling (Mare 1991, p. 23). For additional evidence of increasing edu- 
eational hoinogamy in the 1970s and 1980s, see Qian and Preston 1993. 

48. Oppenheimer, 1988. 

49. DES 1992, Tables 160, 168. 

50. Buss 1987. For evidence that this phenomenon is well underway, see Qian 
and Preston 1993. 

51. In rhe NLSY, whose members graduated from high school in the period 
1976-1983, 59.3 percent had obtained a bachelor’s or higher degree hy 
1990. ln the “High School and Beyond” study conducted hy the Depart¬ 
ment of Education, only 44 percent of 1980 high school graduates who 
were in the top quartile of ability had obtained a B.A. or B.S. hy 1986 (Ea¬ 
gle 1988a, Table 3). 

52. See Chapter 1. 

53. Authors’ analysis of rhe NLSY. 

54. Authors’ analysis of the NLSY. 

55. SAUS 1991, Table 17. 

hitroduction to Part 11 

1. Sussman and Steinmetz 1987. This is still a valuable source of information 
about myriad aspects of family life, mainly in America. 


2. For example, in the last ten years, out of hundreds of articles and research 
notes, the preeminent economics joumal, American Economic Review, 
has published just a handful of articles that call upon IQ as a way of un~ 
derstanding such problems. The most conspicuous exceptions are Bishop 
1989; Boissiere et ah 1985; Levin 1989; Silberberg 1985; Smith 1984- 

3. The crirerion for eligibility was that they he ages 14 to 21 on January 1, 
1979, which meant that some of them had turned 22 hy the time the first 
interview occurred. 

4- Details of the Department of Defense enlistment tesrs, the ASVAB, are 
also given in Appendix 3. 

5. The test hattery was administered to small groups hy trained tost person- 
nel. Thar each NLSY suhject was paid $50 to take the test helped ensure 
a positive attitude toward the experience. 

6. See Appendix 3 for more on the test and itsg loading, and the Introduc- 
tion for a discussion of g itself. 

7. Raw AFQT scores in the NLSY sample rose with age throughout the age 
cohorts who were still in rheir reens when they rook the test. The sim- 
plest explanation is that the AFQT was designed by the military for a pop¬ 
ulär ion of recruits who would he taking the test in their late reens, and 
younger youths in the NLSY sample got lower scores for the same reason 
that high school freshmen get lower SAT scores than high school seniors. 
Ho wovor, a cohort effect could also be at work, whereby (hecause of ed- 
ucational or broad environmental reasons) youths born in the first half of 
rhe 1960s had lower realized cognitive ability than youths born in the last 
half of the 1950s. There is no empirical way of telling which reason ex- 
plains the age-related differences in the AFQT or what the mix of rea¬ 
sons might he. This uncertainty is readily handled in the multivariate 
analyses by entering the subject’s birthdate as an independent variable (all 
the NLSY sample took the AFQT within a few months of each other in 
late 1980). When we present descriptive statistics, we use age-equated 
cenriles. 

8. We assigned the NLSY youths to a cognitive dass on the basis of their age- 
equated centile scores. We use the dass divisions as a way to communicare 
the data in an easily understood form. It should be rememhered, however, 
that all of the Statistical analyses are hased on the actual test scores of each 
individual in rhe NLSY. 

9. Regression analysis is only remotely related to the regression to the mean 
referred to earlier. See Appendix 1. 

10. Age, too, is always part of the analytic package, a necessity given the na- 
ture of the NLSY sample (see note 7). 

11. The white sample for the NLSY was chosen by first selecting all who were 
caregorized by the interview screener as nonhlack and non-Hispanic. From 
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this gioup, we excluded all youths who idencified their own ethnicity as 
Asian, Pacific, American Indian, African, ov Hispanic. 

Chapcer 5 

1. Ross et al. 1987. The aurhors used the sample tapes for the 1940 and 1950 
censns ro calculate the figures for 1939 and 1949, antedating the begin- 
ning of the annual poverty statistics in 1959. The numbers represent roral 
money income, irvcluding govemment transfers. The figure for 1939 is ex- 
trapolated, since the 1939 census did not include dara on income orher 
than eamings. 1t assumes that the ratio of poverty hascd on eamings to 
poverty hased on total income in 1949 (.761) also applied in 1939, when 
68.1 percent of the population had eamings that put thetn he low the 
poverty line. Since govemment transfers increased somewhat in rhe in- 
terveningdecade, the resulting figure for 1939 should he considered a lower 
hound. 

Ir may he asked if the high poverty pereentage in 1939 was an artifact 
of the Great Depression. The numbers are inexact, but the answer is no. 
The poverty rate prior to the Depression—defined by the Contemporary 
poverty line—was higher yet. (See Murray 1988b, pp. 72-73). 

2. See the introduction to Part II for more on rhe distinction between inde¬ 
pendent and dependent variables. 

T Jensen 1980, p. 281. 

4. The observed stability of tests for children up to 10 years of age is reason- 
ahly well approximated hy the formula, 


vvhere r u and r,, are the reliahilities of the tests on occasions 1 and 2, 
C.A, and CA, are the subject’s chronological age on occasions 1 and 2, and 
r,, is rhe correlation between a tesr taken and retaken at ages CA, and CA,. 
See Bloom 1964 for a full discussion. 

5. After age 10, the correlation of tost scores will usually fall between the 
product of rhe reliahilities of the two tests and the square root of their prod- 
uct. Thus, for example, the correlation of two measures of IQ after age 10 
when hoth tests had reliahilities of .9 may he expected to fall between .81 
and .9. Since rhe best IQ tests have reliahilities in excess of .9, this is tan- 
tamount to saying that the stability of scores is quire high. Following are 
some sample reliahilities as reported in the puhlisher’s cest manuals. W1SC 
= .95, WAIS = .97, Wonderlic Personnel Test = .95. The reliahilities of 


some of the major Standardized achievement tests are also extremely high. 
For example: ACT - .95, SAT = 90+, California Achievement Tests = 
.90-.95, Iowa Test of Basic Skills Composite = .98-,99. For a longer list of 
reliahilities and an accessihle discussion of hoth reliahility and st.ahility, see 
Jensen 1980, Chap. 7. 

6. Is the re reason ro rhink thar, had the test been ad ministe red earlier, at age 
7 or 8, the results would have turned out diff erent ly ! The answer, with some 
reservations, is no. We would ohserve the normal level of flucruation in 
tests administered at ages 7 and 20, wirh some individuals scoring higher 
and some lower as they grow up. The correlarions between a person’s IQ 
ohrained at age 7 and social behavior in adulthood would support the same 
qualitative conclusions as those hased on an IQ ohrained at age 20. The 
correlations using the younger scores would be small er, beeause they mea- 
sure the adult trait of inteiligence less reliahly than a score ohrained later 
in lile. See Appendix 3 for a discussion of changes in IQ among the mem- 
hers of the NLSY sample. 

7. Himmeltarb 1984. 

8. F.g., Ryan 1971. 

9. For a tew words about regression analysis, see the Introduction to Part 11 
and Appendix 1. ln fewer words still, this is a method for assessing the in¬ 
dependent impact of each of a set of independent variables on a depen¬ 
dent variable. The specific form uscd here is called logistic regression 
analysis, the appropriate method for binary dependent variables, such as 
yes-no or female-male or married-unmarried, 

10. We eliminate srudents to avoid misleading ourselves with, for example, 
rhird-year law students who have low incomes in 1989 hut are soon ro be 
making high incomes. 

11. Note a distinction: Age has an important independent effect on income 
(income rrajecrories are highly sensirive to age), but not on the yes-no 
quesrion of whether a person lives above the poverty line. It is also worth 
noting thar age in the NLSY is restricted in ränge beeause the sample was 
all born within a few years of each other. 

12. The imaginary person is sexless. 

IT We refrain from precise numerical estimates of how much more important 
IQ is than socioeconomic hackground, for two reasons. First, they are not 
essential to the point of this discussion. Seconcl, doing so would get us inro 
Problems of measurement and measurement error that would needlessly 
complicate the text. Ir seems sufficient for our purpose note rhar IQ has 
a greater impact on the likelihood of being poor than socioeconomic hack¬ 
ground, as those variables are usually measured. 

14. The 1991 poverty rate for persons 15 and over was 1 l .9 percent, compared 
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to 22.4 percent for children under 15. U.S. Bureau of the Census, 1992 
Table I. 

15. For an analysis of the demographic reasons and so me measurement issues, 
see Smith 1989. 

16. U.S. Bureau of the Census 1992, Tahle C, p. xiv. 

17. U.S. Bureau of the Census 1992, Tahle C, p. xiv. 

18. Eggebeen and Lichter 1991; Smith 1989. 

19. Civen childless white men and women of average age, socioeeonomic 
hackground, and IQ, the expected poverty rares are only 1.6 percentage 
points apart and are exceedingly low in hoth cases: LI and 4-7 percent, re- 
spectively. 

20. The relationships of IQ to poverty were Statistical ly significant heyond rhe 
.01 level for hoth married and unmarried women. Our policy rhmughour 
the book is not rourinely to report significance statistics, hut ar the same 
time not to present any relationship as being substantively significant un- 
less we know that ir also is Statistically significant. 

21. An entire draft of the book was wrirren using a different measure of IQ. As 
de sei ihed in Appendix 5, the armed forces changed the scoring System for 
the AFQT in 1989. The first draft was written using the old Version. Af¬ 
ter discussing the merirs of the old and new measures at length, we decided 
to switch to the new one, because, for arcane reasons, it is psychomerri- 
cally superior. The substantive effects of this change on the conclusions in 
the book are, as far as we can teil, effectively nil. All of the analyses have 
also been repeated wich two versions of the SES index, and many of rhe in 
with three. Again, the three versions yielded substantively indisringuish- 
able results. But each of the successive versions of the SES index was, in 
our judgment, a theoretically more satisfying and statistically more robust 
way of capruring the construct of “socioeeonomic Status.” 

Regarding the specific analysis of the role of gender and marital Status 
in mediating the relationship between IQ and poverty: Originally, the 
analysis (and the graphie included in the text on page 188) was hased on 
married/unmarried, men/women. Then we looked more closely at women 
and their various marital situations, then at those marital situations for 
wo men with children. All of the poverty analyses were conducted with two 
measures of poverty: the official definition (represented in this book), and 
a definition hased on cash inoome obtained from sources other than 
government transfers. We decided to present the results using the official 
definition to avoid an extra layer of explanation, but we have the comfort 
of knowing that the Interpretation firs both definitions, excepr for a few 
nuances that are not important enough to warrant a place in this concise 
an account. We have conducted some of these analyses for age-restricted 
samples, to see if things change for older cohorts in ways that are not 


captured by using age as an independent variable in rhe regression equa- 
tion. Throughout all of these regression analyses, we were also looking at 
cross-tahulations and frequency distributions to try to see what gnomes 
might be lurking in the regression coefficients. Finally, we duplicated all 
of the analyses you see with and without sample weights, to ensure that 
there were no marked, mysterious differences in the two sets of results. 
There were undoubtedly other iterations and variations that we have 
forgorten over the last tour years. 

None of this will he surprising to our colleagues, for the process we have 
descrihed is SOP for social scientists engaged in complex analyses. But for 
nonspecialists, the story is worth remembering. lt should make you more 
skeptical, insofar as you under stand that such enterprises are not as elegant 
and preordained asauthors (includingus) sometimesmake it sound. But the 
story can also give you some additional confidence, insofar as, when you 
find yourself wondering whether we considcred such-and-such an alterna¬ 
tive way of looking at the data, the chances are fairly good that we did. 

22. ln passing, it just isn’t so for blacks either. The independent roles of poverty 
and socioeeonomic Status are almost exactly the same for blacks in the 
NLSY as for whites. See Chapter 14- 

C Chapter 6 

1. Kronick and Hargis 1990. 

2. For a discussion of definitional issues in measuring the dropout rate, see 
Kominski 1990. 

L Most people ger their high school degrees or equivalences later than at the 
age of 17, so the figure on page 144 implicitly overestimates the Propor¬ 
tion of dropouts in the population as a whole, ar least for recent times. In 
1985, the U.S. Government Accounting Office estimated that 13 percent 
of the population between the ages of 16 and 24 could he characterized as 
school dropouts, which amounted to 4-3 mülion people (cited by Hahn 
and Lefkowitz 1987; Kronick and Hargis 1990). Dropout: rates in some 
locales may differ markedly from the national averages. ln Boston, for 
example, dropping out of the public schools (as distinguished from losses 
due to transferring out of the school System) has recently risen ahove 45 
percent (Camayd-Freixas and Horst 1987). 

4- ln 1990, the percentage of persons ages 25 to 29 who had completed four 
years of high school or more was 85.7 percent, higher than the plotted 
“graduation rat io,” which is hased on 17-year-olds (National ("enter for 
Education 1992, Table 8). 

5. Quoted in Clignet 1974, p. 38. See Chapter 22 for additional discussion. 

6. Tildstey 1936, p. 89. 
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*6 These numbers represent an unweighted mean ot rhe six studies of ninth 
graders and the nine studies of students who were either seniors or gradu- 
ares. When sample sizes are taken into account, the (weighted) means for 
the two groups are 104.2 and 105.5 (Finch 1946, Table 1, pp. 28-29). This 
may understate the degree ofdifference between the dropout and rhe high 
school senior. Other studies indicate that within any given school, a Sta¬ 
tistical relationship existed between IQ and rhe likelihood offinishing high 
school. In urhan areas, the size of the correlation itself could he suhsran- 
tial. ln one of the best such studies, Lorge found for the city of New York 
in the 1930s that the correlation ot IQ wirh highest completed grade was 
+.66 (Lorge 1942). Some of the individual studies of specific high schools 
conducted during that period reviewed by Finch also showed larger differ- 
ences. But those studies tended to he subject to a number of technical er- 
rors. Even giving substantial weight to them, the difference between rhe 
mean IQ of the high school dropout and youths who made it to the senior 
year during the 1920s was considerably less than half a Standard deviarion 
(7.5 IQ points). Perhaps children who dropped out betöre the ninth grade 
had somewhat lower lQs, so that the Overall difference between diploma 
holders and dropouts was larger than the difference between ninth graders 
and twelfth graders. The data on this issue for the first half of the Century 
are fragmentary, however. 

8. If a third dropped out between ninth grade and twelfth grade, rheir aver¬ 
age IQ must have been 101, compared to 107 for the seniors and gradu- 
afes, if half dropped out, it must have been 103. Assuming a population 
aveiage of 100, this implies that those who dropped out prior to ninth grade 
had still lower scores than those who dropped out afterward. 

9. Iowa State Department of Public Instruction, 1965. 

10. Dillon 1949, quoted in Jensen 1980, p. 334. 

11. Rased on a comparison of the academic apritude scores of rhe ninth graders 
in the sample who had and had not graduated from high school five yeats 
later. The IQ equivalents are computed from a graduate-dropout gap of 
1.14 Standard deviations (SDs) for boys and 1.00 SDs for girls, or approx- 
imately 1.05 Sl)s overall (Wise et al. 1977, Table A-3). In the late 1960s, 
rhe Yourh in Transition study found a difference ofahout .8 SDs on the vo- 
cabulary subtest of the GATB and the Gates Reading Tests between 
dropouts and nondropours, consistent with a 1 SDdifferenee on a full-scale 
battery of tests (reconsrructed from Table 6- 1, p. 100, and Tables C- 3-7 and 
C-3-8 in Bachman er al. 1971). 

1 2. Looking at those numbers, some readers will be wondering how much those 
dropout figures represent cause and how much effect. After all, wouldn’t a 
person who stayed through high school and then took the IQ test have got- 


ren a higher score by virtue of sraying in high school / Thisquesrion ot cause 
and effect may he raised with all of the topics using the NLSY, but it is most 
ohvious for school dropout. But while age has an effect on AFQT scores 
and is always taken into account (either through age-equated scores in the 
descriptive statistics or by entering age as an independent variable in the 
regression analyses), there is no reason to think that presence in school is 
decisive. The simplest way to document this is by replicating the analyses 
for a restricted sample of youths who were age 16 and under when they 
took the test, thereby excluding almost all of the memhers of rhe sample 
who might create these artifacts. Having done so for all of the results re- 
ported in this chapter, we may report that it makes no difference in terms 
of interpretations. We will not present all of these duplicate results, but an 
example will illustrate. 

Using the full sample of whites, the mean lQs, expressed in Standard 
scores, of those who completed high school via the normal route, those 
who got a high school eqtiivalency, and those who dropped out perma¬ 
nent ly were +.37, -.14, and —.94 respectively. For whites who took the 
AFQT hefore they were age 17, the comparable means were + .34, -.04, 
and -.95. The main effect of using the age'restricted sample is drastically 
to reduee sample sizes, which we judged to be an unnecessary sacriti.cc. The 
NLSY data are consistent with other invesr.igations of this issue (e.g., 
Husen and Tuijnman 1991). Continued schooling makes a modest 
contribution to intellectual Capital but not enough to make nmcb differ' 
ence in the basic relationships linking IQ to other outcomes. Chapter 17 
specifically discusses the impact ot schooling on IQ, and Appendix .3 elab- 
orates on the relationship of schooling to IQ in rhe NLSY. 

13. Other data confirm this general picture. In the High School and Beyond 
national sample conducted by the Department of Education in 1980, it was 
found that those in the lowest quartile on the cognitive ahility test dropped 
out at a rate of 26.5 percent, compared to 14-7 percenr, 7.8 percent, and 
3.2 percent in the next three quartiles, respectively (Barro and Kolstad 
1987, Table 6.1, p. 46). Similar results have been found in other recent 
studies of dropouts and cognitive ahility (e.g., Alexander er ab 1985; Hill 
1979). Comparable rates of dropping out across the fQ categories and 
across categories defined by vocabulary test scores were also found in the 
earlier Youth in Transition study, based on approxirnately 2,000 men 
selected to be representative of the national population in the tenth grade 
in 1967 (Bachman er al. 1971). For an estimate of the Löss in cognitive 
ahility that may be attributed to dropout itself, see Alexander et al. 1985. 

14- The General Educational Development exam is administered by the 
American Council on Education. 
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15. Cameron and Heckman 1992. 

16. DES 1991, Tables 95, 97. ln the NLSY, 9.5 percent of tho.se classified as 
having a high school education got their certification rhrough the GED. 

17. As depicted in, for example, Coles 1967, in his work on certain impover- 
ished populations. The relative rolesofsocioeconomic hackground and IQ 
found in the NLSY are roughly comparnble to those found Cor the Youth 

Transition study based on students in the late 1960s, though the method 
of presentation in that study does not lend itself to a precise comparison 
(Bachman et al. 1971, Chap. 4-6). 

18. ln passing, it may be noted that these results hold true for blacks as well. 
Of the blacks in the NLSY who permanently dropped out of school, none 
was in the top quartile of IQ. Only nine-tenths of 1 percent of black per¬ 
manent dropouts were in the top half of IQ and the hottom half of SES. 
See Ohapter 14 as well. 

19. ln a logistic regression, wirh all independent variables expressed as Stan¬ 
dard scores, the coefficients for IQ, SES, Age, and the SES x IQ inrerac- 
tion term were 1.91, .98, -.06, and .32, respectively. The inrercept was 
2.81. The interaction term was significant at the .005 level, and r = .38. 
The equation is predicting “true” for a binary variable denoting high school 
graduation (with permanent dropout as the “false” state). 

20. Press accounts of the GED population suggest that the typical youngster 
in it had trouble with the routine of ordinary school and comes from un- 
commonly deprived family circumstances (e.g., Marriot 1993). 

21. Matarazzo 1972, pp. 178-180. 

22. The percentages were 68 and 2.3, respectively. 

Charter 7 

1 • The figure on page 156 also echoes some of the large macroeconomic torces 
that we Jid discuss in preceding chapters. To some extern, the pool of 
“16-19-year-olds not in school" has changed as high schools have retained 
more students longer and Colleges have recruited larger numbers of the 
brightest into College. As the pool has changed, so perhaps has the em- 
ployability of its memhers. The greater employment problems shown by 
the figure also fit in with the discussion about earnings in Chapter 4 and 
the way in which income has stagnated or fallen for those without College 
educations. For concise reviews of the empirical literature on lahor supplv 
and unemployment, see Heckman 1993; Topel 1993. Studies focused on 
young disadvantaged men include Wolpin 1992; Cogan 1982; Bluestone 
and Harrison 1988; Cohen 1973; Holzer 1986. There is, of course, a large 
hterarure devoted explicirly to blacks. See Chapters 14 and 20. 


2. We conducted parallel analyses with a sample based on the most rccenc 
year ot ohservation (hack to 1984), which enahled us to include data on 
some men who were being followed earlier but subsequently disappeared 
from the NLSY sample. The purpose was tc^> cotnpensate for a potential 
source of attririon bias, on the assumption that men who disappeared from 
the NLSY sample might be weighted to some degree toward those with the 
fewest Connections to a fixed address and (by the samc token) to the lahor 
market. The results obtained by this method were substantive ly indistin- 
guishable from the ones reported. 

T We replicated all of the analyses using the actual number of weeks out of 
the lahor force as the dependenr variable instead of a binary yes-no mea- 
sure of whether any time was spenr out of the lahor force. The relative roles 
of the independent variables were the same as in the reported analyses, 
with similar comparative magnitudes as well as the same signs and levels 
of Statistical significance. The relationship, such as it is, does not seem to 
he conccntrated among the children of the very wealfhy. 

4. A more fine-grained examination of the data reveals that absence from the 
lahor force and joh disabilities is exrraordinarily concentrated with in a lim¬ 
ited set of the lowest-status johs. Using a welLknown index of job prestige, 
the Duncan index, 46 percent of the reports of joh limitarions and 6 3 per- 
cenr of those who reported being prevented frotn working (but who were 
still listing an occupation) came from jobs scored 1 to 19 on the Duncan 
scale, which ranges from 1 to 100. A total of 975 white men in the NLSY 
listed such a job as their occupation in 1990. The five most common johs 
in this ränge, accounting for 35 percent of the total, were truck driver, au¬ 
tomobile meehanic, construetion laborer, carpenter, and janitor. Another 
299 white males working in blue-collar jobs scored 20 to 29 on the Dun- 
can scale. The five most common jobs in this ränge, accounting for 37 per- 
cent of the total, were welder, heavy equipment meehanic, other meehanic 
and repairman, hrick mason, and farmer. Another 158 white males were 
working in blue-collar jobs scored 30 to 39 on the scale. The five most com¬ 
mon johs in this ränge, accounting for 47 percent of the total, were deliv- 
ery man, plumber and pipefit ter, mach inist, sheet metal worker, and 
fireman. 

Looking over these jobs, it is not readily apparent that the lowest-rated 
jobs in terms of prestige are also the physically most dangerous or de- 
manding. Construetion work fits that description in the lowest category, 
but so does fireman, sheet metal worker, and others in the higher cate¬ 
gories. Meanwhiie, some of johs in the lowest category (e.g., truck driver, 
janitor) are not seif-evident ly more dangerous or physically demanding 
than some jobs in the higher categories. Or to put it another way: If a third 
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parry were given these fifteen job titles and told to rank them in terms of' 
potential accidents and the importance of physical Fitness, it is unlikely 
that the List would also be rank-ordered according to the job prestige in- 
dex or even that the rank ordering would have much of a positive corre- 
lation with the job prestige index. 

Instead, the index was created based on the pay and training i hat the 
jobs entail both of which would tend togive higher ratings to cognitively 
tnore demanding jobs. And so indeed it works out. Here are the mean IQ 
scores of white males in hlue-collar jobs, subdivided by groups on the 
Duncan scale, alongside the number per 1,000 who reported some form of 
job-related health limitation in 1989: 


Duncan Sode Score 
(Limited ro Blue Collar 

Mean IQ 

No. per 1,000 
with Job-Relared 

Occupations) 

Percentile 

Health Disability 

0-9 

35th 

52 

10-19 

40th 

55 

20-29 

48th 

32 

30-39 

56th 

26 

40-49 

59rh 

16 

In short, the results of the 

regression analysis 

i indicating that IQ 


important relarionship to job disability even among blue-collar jobs, and 
even afrer takingage and yearsofeducation intoaccount, are not explained 
away by the differentes in the physical risks of these occupations. The same 
condusion holds true when the analysis is conducted only for blue-eollar 
workers and the variable “years of edueation” is added to the equarion. The 
coefficient relating JQ to likelihood of disability is about four times the co- 
efftcient for years of edueation (with age as the other independent variable 
constant). Intriguingly, the opposite is true when the analysis is conducted 
just for white-collar workers: Years of edueation is important, wiping out 
any independent role for IQ. Interpreting this is difficult, both hecause 
health disability is such a rare phenomenon among white-collar workers 
and hecause IQ becomes so tightly linked to advanced edueation, which 
in tum is associated with jobs in which physical disability is virtually ir- 
i elevant (short of a srroke or other accident causing a mental impairment). 

5. Terman and Oden 1947. 

6. Hill 1980; Mayer and Treat 1977;OToole 1990; Smith and Kirkham 1982. 

7. Grossman 1976; Kiragawa and Hauser 1960. 

8. Restricnon of ränge (see Chapter 3) might also reduce the independent 
role of IQ among College graduates. 


Chapter 8 

1. For a review of the liferature about family decline, see Popenoe 1991. 

2. U.S. Bureau of the Census 1992, Tahle 5 1. 

3. Retherford 1986. 

4- Garrison 1968; James 1989. 

5. The eognitive elite Jid get married at sotnewhat older ages than others, 
and this difference will ^row as the NLSY cohort gets older. Judging from 
other data, almost all of those in the hottom half of the IQ distrihutton 
who will ever marry have already married hy 30, whereas many of that 29 
percenr untnarried in Class 1 will eventually marry, raising their mean age 
of marriage hy some unknown amount. It all of them married ar, say, age 
40, the average age at marriage would approach 30, which may be taken 
as the highest mean that the NLSY could plausihly produce as it follows 
irs sample into middle age. 

6. In his famous lifetime study of intellectually gifred children horn around 
1910, Lewis Terman found that, as of the 19 30s and 1940, highly gifred 
men eventually got married at higher rares than the national norms— 
about 84 percent, compared ro a national rate of 67 percent tor men of sim- 
ilar age. Gifted women married later than the average woman, but by their 
midGOs rhey too had higher marriage rates than the general population, 
rhough the difference was not as great as for men: 84 percent compared to 
78 percent (Terman and Oden 1947, p. 227). 

7. Cherlin 1981, Figure IG. His estimarion procedure suggesrs that the odds 
of eventual Jivorce in 1980 were 54 percent. Also see Raschke 1987. 

8. We are höre calculating odds ratios—the likelihood of marital survival di- 
vided hy the likelihood of Jivorce within the firsr tive years—from the tahle 
on page 174. The ratio of odds ratios for marital survival versus divorce Jur- 
ing the first five years of marriage was 2.7, comparing Glass I to Class V. 

9. In addition to the Standard variables (age, parental socioeconomic Status, 
and IQ), we added “date of first marriage.” We wished to add age ar first 
marriage as well, hur it was so highly correlated with the date of first mar¬ 
riage in the eruire white sample (r = +.81) that the two variables could not 
he used together. It was possible to use them together in some of the suh- 
samples we analyzed. The pattem of results was unchangcd. 

10. Different subsers of white youths, both the entire sample of those who had 
married and the suhset of those who had reached the age of 30, and the 
subset below the age of 30 all yielded similar results. 

11. E.g. Raschke 1987; Sweet and Bumpass 1987. 

1 2. Higher socioeconomic Status is also associated with a lower probabiUty of 
divorcc in the College sample, though the independent eftect of parental 
SES is much smaller than the independent eftect of IQ. Socioeconomic 
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Status had an insignificantly direct relationship with divorce for the high 
school sample. Thinking hack to the analysis of mar nage, note a curious 
contrast: IQ makes a lot of difference in whether high school graduates get 
married but not in whether they get divorced. IQ makes little difference 
in whether College graduates get married by the age of 30 but a lot of dif¬ 
ference in whether they get divorced. Why ! We have no idea. ln any case, 
embedded in this complicated set of findings are intriguing possihilities, 
which warrant a fulbscale analysis. 

13. Raschke 1987; Sweet and Bumpass 1987; Teachman et al. 1987. 

14- Even a genetic component has heen invoked to explain the fact that di¬ 
vorce runs in families. Not only do children tend to follow their parents 1 
path toward divorce, but identical twins are more correlated in their like- 
lihood of divorce than fraternal twins, a difference that offen hetrays some 
genetic. influence. McGue and Lykken 1992. 

1 5. Those living with only the fathcr did as well as those living with hoth bi- 
ological parents. 

16. See references in Raschke 1987; South 1985. 

17- Bronislaw Malinowski, Sex, Culture, and Myth (1930), quoted in Moyni- 
han 1986, p. 170. 

18. TTie production of illegitimste habies per unir population has also in- 
creased during this period, with the fastest growth occurring during the 
1970s. In the jargon, the rate of illegitimste births has increased as well as 
the ratio. The distinction hetween rate and ratio raises a technical issue 
that has plagued the discussion of iüegitimacy in recent years. Tradirion- 
ally, illegirimacy rates have heen computed by dividing che nutnher of il- 
legitimate births by the nu naher of umnarried wo men. In a period when 
marital patterns are also shiffing, this has the eftect of confoundmg tvvo 
different phenomena: the number of illegitimste births in the numerator 
of the ratio and the number of unmarried women in the denominator. To 
estimate the rate of change in the production of illegitimste children per 
unit population, it is essential to divide the number of illegitimste births 
by the entiie population (or, if one prefers, by the number of woinen of 
childheaiing age). This is almost never done, however, in nontechnical 
discussions (or in many of the technical ones, for that matter). Ru a dis- 
cussion of rhe difference this makes in interpreting rrends in illegitimacy, 
see Murray 1993. 

19. Sweet and Bumpass 1987, p. 95. In 1960, there were 73,000 never-married 
mothers hetween the ages of 18 and 34; in 1980, there were 1,022,000. 

20. Bachu 1991, Tahle I. The figures for ages 18 to 34 are interpolatcd ffom 
the published figures for ages 1 5 to 34. 

2 1. Not to mention that IQ has changed in the wrong direction to explain in- 
creasing illegitimacy (see the Flynn Effect,discussed in Chapters 13 and 15). 


22. As in the case of school dropout, one may ask whether having a baby out 
of wedlock as a teenager caused school dropout, therefore resulting in an 
artificially low IQ score. As hefore, the cleanest way to tesf the hypothe- 
sis is to select all the women who had their first bahy after they took the 
test in 1980 and repeat the analyses reported here, introducing a control 
for age at first birth. When this is done, the relationships reported con- 
tinue to apply as strongly as, and in some cases more strongly than, they 
do for the entire sample. 

A similar causa! tangle is associated with the age at first birth. Age at 
first birth is a powerful explanatory variable in a Statistical sense. lt can 
drastically change the Parameters, especially the importance of socioeco- 
nomic Status and IQ, in a regression equation. But, in the 1990s, what 
causes a girl in her teens to have a haby? Probably the same rhings that 
might cause her to have an i[legitimate haby: She grew up in a low-status 
household where having a baby young was an accepted thing to do; she is 
not very bright and gets pregnant inadvertently or because she has not 
thought through the consequenc.es; or she is poor and has a haby because 
it öfters hetter rewards than not having a haby, whether rhose rewards are 
tangible in the form of an income and apartmenr of her own through wel- 
fare, or in the form of having someone to love. And in fact all three vari¬ 
ables—parental SES, IQ, and whether she was living in poverty prior to 
rhe birth—are powerful predictors of age at first birth, explaining 36 per- 
cent of the variance. Furthermore, age at first birth cannot be a cause of 
parental SES and poverty in the year prior to birth. Empirically, it can he 
demonstrated not to be a “cause” of the AFQT score, using rhe same logic 
applied to the case of illegitimacy. 

23. RindfussetaL 1980. 

24. Abrahamse er al. 1988. The analysis is based on a sample of 13,061 girls 
who were sophomores in 1980 at the time of the High School and Beyond 
(HS&JB) haseline survey and also responded to the first follow-up ques- 
tionnaire in 1982. 

25. The exact figures, going from the bottom to the top quartile in socioeco- 
nomic. Status, are 38.7 percent, 29.7 percent, 19.9 percent, and 11.7 per- 
cent, based on weighted data, computed by the authors from the HS&B 
database. Figures reported here and on other occasions when we refer to 
the RAND study will sometimes show minor discrepancies with the pub¬ 
lished account, because Abrahamse er al. used imputed figures for certain 
variables, based on schoolwide measures, when individual data were miss- 
ing. Our calculations do not use any imputed figures. As in the RAND 
study, all results are based on weighted analyses using the HS&B popula¬ 
tion weights. 

26. For mothers of an illegitimate baby, the mean on the test of cognitive ahil- 



702 Notes to pages 186-195 


Note s to pages 197-199 703 


ity was .73 SD below the mean for all girls who had babies, and .67 St') he- 
low the mean for all white girls (mothers and nonmothers). 

27. Limiting the analysis to first births avoids a nuniber of technical Problems 
associated with differential number of ehildren per woman by cognitive 
and socioeconomic dass. Analysen based on all ehildren horn hy the 1990 
interview show essential ly the same results, however. We also conducted 
a parallel set of analyses using as the dependent variable whether the 
woman had ever given birth to a child out of wedlock (thereby adding 
women withour any ehildren at all ro the analysis). The interpretations of 
the results were not markedly different for any of the analyses presented in 
the text. 

28. We are, as usual, comparing the effects of a shift equal to ±2 SDs around 
the mean for both independent variables, cognitive ability and socioeeo- 
nomic Status. 

29. Bachu 199.3, TableJ. 

30. Bachu 1993, TableJ. 

3 l. The comparahle probabilities given parental SES Standard scores of-2 and 
+ 2 were 31 percent and 19 percent. 

32. The literature is extensive. Two recent reviews of the literature are Moffitt 
1992 and Murray 1993. See also Murray 1994. 

33. The writing on this topic is much more extensive for the black community 
than the white. See, for example, Anderson 1989; Duncan and Hoffman 
1990; Furstenberg et al. 1987; Hogan and Kitagawa 1985; Lundberg and 
Plotnick 1990; Rowe and Rodgers 1992; Teachman 1985; Moffirr 1983. 

34- For a detailed presentarion of this argument, see Murray 1986h. 

35. An analysis based nor on the dichotomous variable, poverty, hut on in- 
come had essentially the same outcome. 

36. When we repeat the analysis yet again, adding in the presence of the hio- 
logical father, these resulrs are sustained. Poverty and cognitive ahility re- 
main as important as before; the parents 1 poor socioeconomic sratus does 
not increase the chances of illegitimste babies. 

Chapter 9 

1. Louchhcim 1983, p. 175. See also Liebmann 1993. 

2. Bane and Elhvood 1983; Ellwood 1986b; Hoffman 1987. 

3. The studies are reviewed in Bendick and Canru 1978. 

4. Hopkins et al. 1987. 

5. This figure includes women not reflecred in the rable who did not go on 
AFDC within the first year after birth, received weif are at some later dato, 
but did not become chronic recipients. 

6. In all cases, we limit the analysis to women for whom we have complete 


data and whose child was born prior to January 1, 1989. We also conducted 
this analysis with another definition of short-term recipiency, limiting the 
sample to women whose ehildren had been born prior to 1986, divided into 
women who had never received welfare subsequently and women who had 
received welfare up to half ot the years that they were observed but did not 
qualify as chronic welfare recipients. The resulrs were similar to the ones 
reported in the text, with a large negative effecr of IQ and an insignificant 
role for SES. 

7. Bane and Ellwood 1983; Ellwood 1986a; Murray 1986a. 

8. Ellwood 1986a; Murray 1986a. 

9. We conducred a parallel analysis comparing chronic welfare recipients 
with all other mothers, including those who had been on welfare hut did 
not qualify as chronic. There are no important differences in interpreta- 
rion for the results of the two sets of analyses. 

10. Among all white women, only 16 percent had not gotten a high school 
diploma, and 27 percent had achieved at least a bachelor’s degree. 

11. Once again, this analysis has to he based on women with a high school 
diploma because there was no way to analyze welfare recipiency among 
white women with B.A.s. Only two white women with B.A.s in the NLSY 
had become chronic recipients. But for the high school graduates, the ef- 
fect of parental SES is modest—slightly small er than the independent ef¬ 
fecr of cognitive ahility. This pattem was generally shared among women 
who had gone on to get their GED (recail that pcople with a GED are not 
included in the high school sample). 

12. Some of the ohvious explanations are not as important as one might ex- 
pect. For example, most of the high school dropouts who hecame chronic 
welfare recipients were not poor; only 36 percent of them had been below 
the poverty line in the year before birth. Nor is it correct to nssume that 
all of them had babies out of wedlock; nearly half (46 percent) of their first 
babies had heen horn within marriage. But 70 percent of the chronic wel¬ 
fare recipients among the high school dropouts had had their first child be¬ 
fore they turned 19, which means that some very large proportion of them 
had the baby before they would normally have graduated. Among high 
school dropouts who had not had a child before their nineteenrh birthday, 
the independent relationships of IQ and socioeconomic ststus shift hack 
toward the familiär pattem, with the effects of IQ heing much larger than 
those of socioeconomic Status. 

1 3. Indeed, the teenage mothers who did nor become chronic welfare recipi- 
enrs had a slightly lower mean IQ than those who did (23d centiie versus 
26th centiie). Meanwhile, the ones who did not become welfare recipients 
at all had a fractionally higher mean socioeconomic sratus than the ones 
who did (27th centiie versus 26th). 
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14. Having a high school diploma was an. important variable in all of the analy- 
ses ot welfare, over and above the effects of either cognitive ability or so- 
cioeconomic background, and regarding either short-term or chronic 
welfare recipiency. The question is whether the high school diploma—and 
we are referring specifically to the high school diploma, not an equivalency 
degree—reflects a cause or a Symptom. Does a high school education pro- 
pare the young woman for adulthood and the world of work, therehy tend- 
ing to keep her off welfare? Or does the act of getting a high school diploma 
reflect the young woman's persistence and ability to cope that rend to keep 
her off welfare? It is an important question; unforrunately, we were unable 
to think of a way to answer it with the data we have. 

15. All are mutually exclusive groups, Criteria follow those for temporary and 
chronic welfare recipiency defined earlier. 

Chafrter 1 0 

1. Anderson 1936. 

2. See Bronfenbrenner 1958, p. 424, for a review of the literature through rhe 
mid-1950s. For a recent empirical test, see Luster et al. 1989. 

3. Kohn 1959. 

4- Kohn 1959. 

5. Kohn 1959, p. 366. 

6. Heath 1983. 

7. The study also includes “Trackton,” a black lower-class community. 

8. Heath 1982, p. 54. 

9. Heath 1981, p. 61. 

10. Heath 1982, p. 62. 

11. Heath 1982, p. 63. 

12. Gottfried 1984, p. 330. 

13. Kadushin 1988, p. 150. 

14. Drawn from Kadushin, 1988, pp. 150-151. Formally, neglect is defined hy 
one of the leading authorities, Norman Pulansky, as a Situation in which 
the caretaker “permirs the child to experience avoidahle present suffering 
and/or fails to provide one or more ingredients generally deemed essential 
for developing a person’s physical, intellectual or emotional capacities.” 
Quoted in Kadushin, p. 150. 

15. Kaplun, 1976; Smith and Adler, 1991; Steele 1987; Trickett et al. 1991. 

16. E.g., Azar et al. 1984. For a discussion of weaknesses in the state of knowh 
edge about causes and an argument for continuing to treat abuse and ne¬ 
glect separately, see Cicchetti and Rizley 1981. See also Bousha and 
Twentyman 1984; Herrenkohl etal. 1983. 

17. Some recent reviews of the evidence on causation are Hegar and Yung- 


man 1989; Polansky 1981; Zurav in 1989. The intergenerational explana- 
tion is one of the most widely known. For a review of the literature and 
some important qualifications to assumptions about intergenerational 
transmission, see Kauf man and Zigler 1987. 

18. Besharov 1991. 

19. D. Besharov and S. Besharov, quoted in Pelton 1978, p. 608. 

20. Parke and Collmer 1975. 

21. Coser 1965; Horowitz and Liebowitz 1969. 

22. Jensen and Nicholas 1984; Osborne et al. 1988. 

23. Leroy H. Pelton’s literature review is still excellent on the studies through 
the mid-1970s, as is Garbarino’s. See Garbarino and Crouter 1978; Pelton 
1978. Also see Straus and Gelles 1986; Straus et al. 1980; Trickett et al. 
1991. Unless otherwise noted, the literature review in this section is not 
restricted to whites. 

24- U.S. Department of Health and Human Services 1988; Wolfe 1985. 

25. Gil 1970. 

26. Reported in Pelton 1978. 

27. Young and Gately 1988, pp. 247, 248. 

28. Reported in Pelton 1990-1991. 

29. Klein and Stern 1971; Smith 1975. 

30. Baldwin and Oliver 1975. 

31. Cohen et al. 1966; Johnson and Morse 1968. 

32. Smith et al. 1974. 

33. Pelton 1978, pp. 612-613. 

34. Gil 1970. Recall that Chapter 6 demonstrated that cognitive ability was a 
stronger predictor of school dropout than socioeconomic Status. 

35. Brayden et al. 1992. 

36. Crittenden 1988, p. 179. 

37. Drotar and Sturm 1989. 

38. Azar et al. 1984. See Steele 1987 for Supporting evidence and Kravitz and 
Driscoll 1983 for a contrary view. 

39. Bennie 1969. 

40. Dekovic and Gerris 1992. For findings in a similar vein, see Goodnow et 
al. 1984; Keller et al. 1984; and Knight and Goodnow 1988. For studies 
concluding that parental reasoning is not related to social dass, see New- 
berger and Cook 1983. 

4L Polansky 1981, p. 43. 

42. Most tantalizing of all was a prospective study in Minnesota that gave an 
extensive hattery of tests to young, socioeconomically disadvantaged 
women before they gave birth. In follow ing up these mothers, two groups 
were identified: one consisting of thirty-eight young women with high- 
stress Ufe events and adequate care of their children (HS-AC), and the 
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other of twelve young women with high-stress life events and inadequate 
care (HS-NC). In the article, data on all the tests are presented in conv 
mendable detail, except for IQ. In the “method” section that lists all the 
tests, an IQ test is not mentioned. Subsequently, there is this passage, which 
contains everything we are told about the mentioned test: “The only pre- 
natal measure that was not given at 3 months [after birthj was the Shipley- 
Hartford IQ measure. The mean scores on this measure were 26.9 for the 
HS-AC group and 23.5 for the HS-NC group (p = .064)3’ Egeland 1980, 
p. 201. A marginally Statistical ly significant difference with samples of 12 
and 38 suggests a sizable IQ difference. 

43. Friedman and Morse 1974; Reid and Tablin 1976; Smith and Hanson 1975. 
44- Wolfe 1985. 

45. Berger 1980. 

46. Young 1964, eited by Berger 1980. 

47. Wolfe 1985, pp. 473-474. 

48. It is understandable that many survey srudies cannot obrain a measure of 
IQ. But virtually all of the srudies discussed called for extensive coopera- 
tion by the abusive parents. The addition of a short intelligence test would 
seem to have been readily feasible. 

49. The actual quotation is dense but intriguing: “Moreover, they [the British 
researchers] have shown that parental competence (defined as sensitivity 
and responsiveness to infant cues, quality of verbalization, and physical 
contact, and related skills) and adjustment (e.g., low anxiety and adequate 
flexibility) were distinguishing abilities that moderated the impacr of aver- 
sive life events” (Wolfe 1985, p. 478). 

50. Honesty of the respondents apart, the NLSY data do not address this is- 
sue. The question about drinking asked how often a woman drank but not 
how much at any one time. Since a single glass of wine or beer a few times 
a week is not known to be harmful, the drinking data are not interpretable. 

5 1. Roughly equal proportions of smokers in the low and high cognirive classes 
told the Interviewers that they had cut down during pregnancy—about 60 
percent of smokers in each case. 

52. Leonard er al. 1990; Hack and others 1991. 

53. “Low birth weight” Ls operationally defined as infants weighing less than 
5.5 pounds at birth. This definition, however, mixes children who are Gar¬ 
ried to term and are nonetheless underweight with children who are born 
premarurely (which usually occurs for reasons over which the woman has 
no control) but who are otherwise of normal weight and development, ln 
the Jargon, these babies have a weight “appropriate for gestational age” 
(AGA). Babies who weighed less than 5.5 pounds but whose weight was 
equal to or higher than the medical definition of AGA (using the Col¬ 
orado Intrauterine Growth Charts) were excluded from the analysis. 


54- The dip in the Proportion for Class V could also be an artifact of small sam¬ 
ple sizes. The proportion (computed using sample weights) is produced by 
9 out of 116 babies. Sample sizes for the other cognitive classes—II, III, 
and IV—were much larger: 573, 2,059, and 737, respecrively. 

55. Hardy and Mellis 1977. 

56. Gramer 1987. In a revealing sign of the unpopularity of intelligence as an 
explanatory variable, Gramer treats years of education as a proxy measure 
of socioeconomic Status. For other s tu dies showing the re lat ionship of ed¬ 
ucation to infant morrality, see Bross and Shapiro 1982; Keller and Fer- 
rerly 1978. 

57. This is a persistent issue in infant mortality research. There are varying 
opinions about how important the distinction between neonatal and in¬ 
fant deaths may be. See Eberstein and Parker 1984- 

58. Dune an 1993. 

59. The calculation assumes that the mother has average socioeconomic back- 
ground. 

60. It measures, among other things, the emotional and verbal responsiveness 
and involvement of rhe mother, Provision of appropriate play materials, 
variety in the daily routine, use of punishment, and organization of the 
ehilds environment. The HOME index was created and tested by Bcttye 
Caldwell and Robert Bradley (Galdwell and Bradley 1984). 

61. From Glass IV to Class 11, they were the 48rh, 60th, and 68th percentile, 
respecrively. For most of the assessments, including the HOME index, the 
NLSY database contains raw scores, standardized scores, and centile scores. 
For rechnical reasons, it is more accurate to work with standardized scores 
than percentiles when computing group means, conducring regression 
analyses, and so forth. On the other hand, cenriles are much more readily 
undersrood by the ordinary reader. We have conducred all analyses using 
standardized scores, then converted the final results as reported in the ta¬ 
bles back into centiles. Thus, the centiles in the table are ?iot those that will 
he produced by simply averaging the HOME centile scores in rhe NLSY. 

62. We replicated all of these analyses using the HOME index as a continu- 
ous variable, and the substantive conclusions from those replications are 
consistent with the ones reported here. 

63. The HOME index has different scoring for children younger than 3 years 
old, children ages 3 through 5, and children ages 6 and older. We exam- 
ined rhe HOME results for the different age groups and found that they 
could he combined without significant Löss of precision for the interpreta- 
tions we descrihe in the text. There is some evidence that the mother’s IQ 
was most important for the home environment of children ages 3 through 
5 and least important for children ages 6 and older, but the differences are 
not dramatic. 
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64. E.g., Duncan 1993 and almost anything pubtished by the Children’s De¬ 
fense Fund. 

65. We also conducted analyses treating family income as a continuous vari¬ 
able, which showed consistent results. 

66. The poverty measure is based on whether the mother was helow the 
poverty line in the year prior to the HOME assessment. Independent vari¬ 
ables were IQ, mother’s socioeconomic hackground, mother’s age, the test 
year, and the child’s age group (for scoring the HOME index). 

67. The rable on page Z22 shows the predicted odds of being in rhe bottoin 
decile on the HOME index from a regression equarion, using the child’s 
sample weights, in which the dependent variable is a binary repre- 
sentarion of whether an NLSY child had a HOME score in the botcom 
decile, and the independent variables were mother’s IQ, mother’s socioe¬ 
conomic bac.kground, mother’s age, and nominal variables representing 
the test year, the age category for scoring the HOME index, poverty in the 
calendar year prior to the administration of the HOME index, and receipt 
of AFDC in the calendar year prior to the administration of the HOME 
index. 

Odds of Being in 

Mother’s the Bottom Decile 


Mother’s 

Socioeconomic 

ln 

On 

on the HOME 

IQ 

Background 

Poverty’ 

Welfare? 

Index 

Average 

Average 

No 

No 

4% 

Average 

Average 

Yes 

No 

8% 

Average 

Average 

No 

Yes 

9% 

Average 

Average 

Yes 

Yes 

16% 

Average 

Very low 

No 

No 

7% 

Average 

Very low 

Yes 

No 

12% 

Average 

Very low 

No 

Yes 

14% 

Average 

Very low 

Yes 

Yes 

24% 

Very low 

Average 

No 

No 

10% 

Very low 

Average 

Yes 

No 

18% 

Very low 

Average 

No 

Yes 

21% 

Very low 

Average 

Yes 

Yes 

M% 


“Very low” is defined as two SDs below the mean. Poverty and welfare 
refer to rhe calendar year prior to the scoring of the HOME index. 

68. The NLSY reported scores on these indexes for infants under 1 year of age, 
not analyzed here. 


69. This Statement applies to the full white sample, ln. the cross-sectional sam¬ 
ple, used for the regression results in Appendix 4, the role of birth Status 
(legitimate or illegitimate) was not significant when entered along with 
poverty and welfare receipt. 

70. A technical note that applies to the me ans reported in the table on page 
2 30 and in Chapter 15. In applying the national norms, the NLSY declined 
to estimate scores for very low-scoring child re n not covered in the PP VT’s 
scoring tables, instead assigning them a score of zero. For purposes of Com¬ 
puting the means above and in Chapter 15, we assigned a score of 40 (four 
SDs below the mean, and the lowest score assigned in the Standard tables 
for scoring the PPVT) to all children with scores under 40. 

71. Careful readers may he wondering why white children, who have had less 
than rheir fair share of the bottom decile for most of the other indicators, 
account forfully 10 percent of all NLSY children in the bottoin decile. The 
reason is thar the women of rhe NLSY sample (all races) have had a high 
Proportion of low-IQ children, hased on rhe national norms for the 
PPVT—fully 23 percent of all NLSY children ages 6 and older when they 
rook rhe test had IQs of 80 or lower. For whites, 10 percent of the children 
who have been tested fall inte the bottom decile. This news is not quite as 
had as it looks. Just because the NLSY mothers were a national ly represen- 
tative sample of women in acertain age group does not mean that rheir chil¬ 
dren are a national ly representative sample of children. But the news is 
nonetheless worrisome, with implications that arediscussed in Chapter 15. 

72. See Chapter 4 for the discussion of heritability of IQ. 

C/tuprer /1 

1. The proportional increases in property crime tracked more or less with the 
increases in violent crime unril the late 1970s. Since then, property crime 
has moved with in a narrow ränge and in 1992 was actually lower than it 
had been ten years earlier. This divergence hetween violent and property 
crimes is in itself a potenrially significant phenomenon that has yet to be 
adequately explored. 

2. For citations of the extensive literature on this subject, see Chaiken and 
Chaiken 1983; Wilson and Hermstein 1985. The official statistics may 
have understated the increase in these “crimes that people consider seri- 
ous enough to warrant reporting to the police,” insofar as many hurglaries, 
assaulrs, and Street robheries that would have been reported in the 1950s 
(when there was a reasonable chance that the police would conducr. a gen¬ 
uine investigation) are no longer reported in urban areas, where it is taken 
for granted that they are too minor to compete for limited police resources. 

3. A more traditional way to sort the theories is to contrast classical theories, 
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which depicr crime as the rational behavior of free agents, based on costs 
and benefits, with positive theories, which look for the causes of crime in 
society or in psychological makeup (for discussion of criminoLogical 
theory, see, for example, Gottfredson and Hirschi 1990; Wilson and 
Herrnstein 1985). We are distinguishing only among positive theories, 
because the notion of criminals as rational agents seems ro fit few actual 
crimfnals and the role of costs and benefits can readily be ahsorhed hy a 
positive theory of criminal behavior (see Wilson and Hermstein 1985, 
Chap. 2). A distinction similar to ours between psychological and socio- 
logical theories is one between “psychiatric” and “criminologicai” theories 
in Wessely and Taylor 1991. 

4- Freeman 1983; Mayer and Jencks 1989; Wilson and Herrnstein 1985, 
Chaps. 11, 12. 

5. Cleckley 1964; Colaizzi 1989. 

6. Wilson and Hermstein 1985. 

7. Wilson and Herrnstein 1985. 

8. Jn fact, with in criminologicai theory, rhe distinction between heing dis - 
posed to break the law and heing disposed to obey it bas some resonance, 
as illustrated in, for example, Gottfredson and Hirschi 1990. This is a fine 
point of theory, which we cannot elaborate on here. 

9. For more extended discussion of the logic of the link between IQ and com- 
mitting crime, see Gottfredson and Hirschi 1990; Hirschi 1969; Wilson 
and Herrnstein 1985. 

10. Goring 191 3. 

11. Goddard 1914. 

12. Murchison 1926. We know now thar this was a peculiarity of a federal 
prison like Leavenworth, which had relatively few of the run-of-rhe-mill 
offenders rypical in state prisons. 

13. Sutherland 1931. 

14- Haskeil and Yahlonsky 1978, p. 268. 

15. Reid 1979, p. 156. 

16. Hirschi and Hindelang 1977. 

17. Reid 1982. 

18. A halanced, recent summary says, “At this juncture it seems reasonable ro 
conclude that the difference [between offenders and nonoffenders in in- 
telligence] isreal and notduetoanyofthepossiblemethodological orcon- 
founding factors thar have been noted in the literature” (Quay 1987 p. 
107 ff.). 

19. The gap between offenders and nonoffenders is rypically larger on verbal 
than on performance (i.e., nonverbal) intelligence tests (Wilson and 
Herrnstein 1985). It has been suggested that this is because rhe essential 


difference between offenders and nonoffenders is the difference in g; ir is 
well known that verbal scores are more dependent on g than performance 
scores (Gordon 1987; Jensen and Faulstich 1988). Another, not necessar- 
ily inconsistent, Interpretation is that verbal intelligence scores do better 
ar measuring the capacity for internalizing the prohibitions thar help de- 
ter crime in nonoffenders (Wilson and Herrnstein 1985). Multiple of- 
tenders, as distinguished from offenders in general, also have significant 
deficits in logical reasoning ability per se (Reichel and Magnusson 1988). 
Whatever the reason for these patterns of differences, the methodological 
implications are clear: The rare study thar fails to find much of an associ- 
ation between IQ and offending may have used nonverbal scores or scores 
that, for one reason or another, minimize individual differences in g. 

20. E.g., Blumsteinetal. 1985; Denno 1990. National srudies of convicts who 
get rearrested after release also show that rhose with low levels of educa- 
tion (which are presumably correlated with low test scores) are at higher 
risk for recidivism (Beck and Shipley 1989). 

21. Lipsitt et al. 1990. 

22. Reichel and Magnusson 1988. 

23. Hirschi 1969; Wilson and Herrnstein 1985. 

24- Nicholson and Kugler 1991. 

25. The evidence in fact suggests that smart offenders pick crimes with lesser 
likelihood of artest and larger payoffs (Wilson and Herrnstein 1985). 

26, Moffitt and Silva 1988; Hindelang et al. 1981; Hirschi and Hindelang 
1977; Wilson and Herrnstein 1985. 

27- Reichel and Magnusson 1988. 

28. Kandel et al. 1988. 

29. In this sample, there was no significant correlation between IQ and so- 
cioeconomic Status, and IQ remained a significant predictor of offending 
even after the effects of parental SES and the sons’ own Level oi education 
were entered as covariates in an analysis of covariance. 

30. White er al. 1989. 

31. Werner and Smith 1982. 

32. Werner 1989; Wernerand Smith 1982. 

3 k For an entry into this literature, see Farrington and West 1990; Gortfred- 
son and Hirschi 1990; Mednick and others 1987; Wilson and Herrnstein 
1985. 

34- In this regard, it is perhaps worth mentioning thar we original ly intended 
for this book to be about individual differences general ly and social polic.y, 
with intelligence as the centerpiece. We narrowed the focus to intelligence 
partly because it looms so much larger than any other individual trait in 
explaining what is going on, but also out of necessity: Only for criminal 
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behavior is the scientific iiterature extensive enough to have permitted a 
thoroughgoing presentation of individual differences other than intellec- 
tual. 

35. The most serious prob lern is the established and pronounced tendency of 
black juveniles to underreport offenses (Hindelang 1978, 1981). 

36. Not surprisingly, the most serious offenders are the ones who most often 
underreport their crimes. Serious offenders are also the ones most likely to 
go uninterviewed in survey research. Ar the other extreme, minor offend¬ 
ers brag about their criminal exploits. They inflate the real level of “crime” 
by putting minor incidents (for example, a schooUyard fistfight, which can 
easily fit the technical definition of “aggravated assault”) in the same car- 
egory with authentically felonious attacks. 

Since we are focusing on the role of intelligence, self-report data pose 
a special problem, for it has been observed that people of low intelligence 
are less candid than brigbter respondents, This bias would tcnd to weaken 
the correlarion between IQ and crime in self-report data. 

37. The authoritative source on self-report data for juveniles is still Hindelang 
et al. 1981. See also Hindelang 1978, 1981; Smith and Davidson 1986. 

38. Wolfang, Figlio, and Sellin 1972; Wilson and Herrnstein 1985. 

39. These results for the entire age ränge are substantially the same when age 
subgroups are examined, but so me differences may he found. Th ose who 
hecome involved with the criminal justice System at an early age tended 
to have lower intelligence than those who first become involved later in 
their teens. 

40. This represents the top decile of white mal es. To use the same index across 
racial groups is inadvisable because of the different reporting characteris- 
tics of whites and blacks. 

41. For a review of the Iiterature, see Wilson and Herrnstein 1985. 

42. Elliott and Voss 1974. 

43. Thornberry et al. 1985 uses the Philadelphia Co hört Study to de mons träte 
rising crime after dropout for that well-known sample. 

44- The sample includes those who got a GED—most of whom had gotten it 
at the correctional Institution in which they were incarcerated at the time 
of their interview. The results are shown in Appendix 4. 

Chapter 12 

1* Gove 1964- The definition is listed, sadly, as “obsolete.” We can think of 
no modern word doing that semantic job now. 

2. More recently, Walter Lippmann used cmzfiry in his worrying hook (Lipp- 
mann 1955) about what he feared was disappearing with the rising 
U J acobinism” of American political life, the shift he saw early in the Century 


away from representative government toward populist democracy. Early in 
his career as a journalist and social commentator (Lippmann 1922b), Lipp¬ 
mann noted that the ordinary, private person sets the concems of 
governance very low on his or her list of priorities. To govern us, he said, 
we needed a special breed of person, leaders with the capacity to fathom, 
and the desire to promote, the public good. That capacity is what he called 
civility. For a reflection on Lippmann’s conception of civility by a social 
scientist, see Burdick 1959. 

3. The re are other rationales for not voting, as, for example, the o ne pro- 
moted on a T-shirt favored by libertarians: “Don’t vote. It only encourages 
them.” 

4- For an attempt to construe voting as a rational act from the economic 
standpoint, see Downs 1957. 

5. Aristotle 1905 ed., p. 1129. 

6. Although the sample was not strictly representative of the American pop¬ 
ulär ion , it was a broad eross-section, unlikely to be atypical except as a re- 
sult of its underrepresentation of rural and minority children. Hess and 
Torney 1967. 

7. The second graders were excluded from so me of the analyses because so me 
quesrionnaire items evoked too high a rate of meaninglessor nonresponses. 

8. A measure of political efficacy was based on the children’s “agree” or “dis- 
agree” responses to five Statements, including: “I don’t think public offi- 
cials care much what people like me think.” Or, “People like me don’t have 
any say about what the government does.” 

9. Harvey and Harvey 1970. 

10. The exceptions included the measures for political efficacy and political 
participation, both of which were bare ly cor related with intelligence, al¬ 
though slightly correlated with socioeconomic sratus (primärily via 
parental education, rather than family wealrh). The authors speculated 
that the rising cynicism of the young during the later 1960s may in part ac- 
count for these deviant results. 

11. Like other studies (e.g., Neuman 1986, see helow), this one also found that 
the more intelligent someone is, the more likely he or she is to be liberal 
on social issues and conservative on economic ones. Chauvinistic, mili- 
tarisric, and anricoinmunistic attitude were inversely related to intelli- 
gence. 

12. For a brief summary of this Iiterature as of the late 1960s, see White 1969, 
who simUarly concludes that political socialization, as he calls it, is highly 
dependent on intelligence itself rather than on socioeconomic sratus. 

13. Sidney Verba and Norman Nie (1972), leading scholars of American vor- 
ing, distinguish cogently between the study of politics as a political seien- 




714 Notes to pages 258-260 


Notes to pages 260-261 715 


tist approaches ir and political psychology. A political sei ent ist mostly 
wants to und erstand how political participation shapes the choices a Com¬ 
munity makes; a political psychologist tries to und erstand the participa- 
tion itself. This chapter comes closer to political psychology than to 
political science. 

14- Campbell et aL 1960; Milbrath and Goel 1977; Verba and Nie 1972; 
Wolfinger and Rosenstone 1980. 

15. Wolfinger and Rosenstone 1980, p. 1 'S. 

16. Verba and Nie 1972. 

17. The one exception, the frequency with which an individual contacted po¬ 
litical off icials for matters of personal concern, showed no such correla- 
tion, hut it isalso the most amhiguously political. See Vetha and Nie 1972. 

18. There are hints, however, rhar, if socioeconomic Status had been broken 
into components of educational level and income, educational level would 
have predicted political participation better than income. See Figures 6-1 
to 6-3 in Verba and Nie 1972. 

19. Wolfinger and Rosenstone 1980. In even-numbered years, the CPS, a sur- 
vey conducted monthly of a nationally representative sample of tens of 
thousands of Americans, asks about voting in the November elecrion. 
These surveys also include data on income, occupation, education, and 
orher personal and regional variables. The Wolfinger and Rosenstone 
analysis was based on the entire sample of aimost 100,000 respondents in 
the November surveys in 1972 and 1974 and a random subsample used for 
more detail cd modeling. The main rechnique they used is the probit analy¬ 
sis, a form of multivariate analysis for estimaring the changes in probabil- 
ity of some dependent variable—voting, in rhis case—associated wirb a 
change in an independent variable—educational attainment, for exam- 
ple—after the effects of the other variables—say, income or occuparional 
level—are taken account of. 

20. E.g., Peterson 1990. 

21. Neuman 1986. This hook aggregares data from nine studies of voring he- 
rween 1948 and 1980 and comes up with a measure of “political sophisti- 
cation, ’ which seems tohave considerable power in explaining much ahout 
voting, including simple tumout. The “key causal facror” for political so- 
phistication, Neuman found, is education, which explained four times as 
much of the variance in sophistication as the next most intluenrial facror 
in a 1 ist rhar included age, race, sex, the other components of socioeco¬ 
nomic Status, parental behavior, and region of the eounrry. 

22. Wolfinger and Rosenstone 1980, p. 19. 

23. Besides the works already ciced, for other overviews coming to the same 
Basic conclusion, see Campbell et al. 1960; Milbrath and Goel 1977; Neu¬ 
man 1986. 


24. “It is difficult to find Support in our data for notions that a generic Status 
variable plays any part in the motivational foundations of the decision ro 
vore” (Wolfinger and Rosenstone 1980, p. 35). Perhaps there is some ef- 
fect of income on voting at the lowest levels hut throughout the ränge of 
income, it seems to have no independent predictive value of its own. 

25. Verba and Nie 1972, p. 335. 

26. How someone vor es, rarher than whether, can he more plausibly connected 
to the out ward benefits gained from rhe ourcome of an elecrion. And many 
political scientists focus more on political preference than on level of en- 
gagement. Political preferences, too, have rheir individual correlates, hut 
we will not try to summarize these results as well (hut see, for example, 
Fletcher and Forbes, 1990; Granherg and Holmberg 1990; Milbrath 1977; 
Neuman 1986; Nie et al. 1976). 

27. There is an indirect argument to be made by combining four observations: 
(1) We know for sure that one of the traits roughly measured hy educa¬ 
tional attainment is intelligence. (2) As we showed in Chapter 1, Ameri¬ 
can educational opportunities are more effieiently distributed by cognitive 
ahility than they have ever been, here or elsewhere. (3) lt is here and now 
that we see the strongest correlations between voting and educational at- 
tainmenr. (4) In countries where education and cognitive ahility are not 
so thoroughly enmeshed, education has less impact on voting. To fi 11 in the 
story: Düring the 1950s and 1960s, rhe level of political participation rose 
more rapid ly rhan the educational level of the population (Verba and Nie 
1972, p. 252). Looking hackward, we see the other side of the same coin. 
ln 1870, only 2 percenr of the American population had finished high 
school; even fewer were going to College. Yer voting rates may have been 
higher rhan they are now. Kleppner (1982) concludes that voting rates 
were more than 11 percentage points above where they should have been, 
had education had the same effects in the 1880s that they had in 1968. 
Shortridge (1981) has a lower estimate of voter turnout in the late nine- 
teenth Century, hut still one that exceeds expectations, given the educa¬ 
tional levels of the period. Proper historical comparisons must, of course, 
take into account changes in voting laws, in polt taxes, in regisrration 
requirements, as well as the effects of the extension of suffrage to women 
and to 18- to 20-year olds, However, after all those corrections are made, 
scholars agree that past voting rates (post-Civil War, nineteenth Century, 
for example) are incommensurately high or present rates are inconv 
mensurately low, given the changes in levels of formal education of the 
general public. Except in the South of the Reconstruction, the correlation 
between education and voting rate was negative from 1876 ro 1892, just 
the reverse of what it is now (see Kleppner 1982). The international data 
indicating that education is less important in voting where education is 
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not so enmeshed with cognitive ability come from Milbrath and üoel 

(1977). 

28. Exposure to political print media was another influential factor, but this, 
too, tumed out to he most strongly associated with rated intelligence (see 
Luskin 1990). 

29. Theso-called Bay Area Survey, described in Neuinan 1981, 1986. 

30. Seenote21. 

31. Neuman 1986, p. 117. 

32. Useful summaries can be found in Abramson and Claggett 1991; Hill and 
Lutrbeg 1983; Kleppner 1982; Peterson 1990; Rothenberg and Licht 1982. 

33. E.g., Milbrath and Goel 1977. Biological and social scientists have lately 
tried to enrich our understanding of “political man” by showing the links 
to social behavior in other species. For background to the huge literature 
on the variety of influences on political behavior and attirüdes, see Con- 
verse 1964; Kinder and Sears 1985; Rokeach 1973. 

34. Harvey and Harvey 1970. 

35. Neuman 1986. 

36. Luskin 1990. 

Chapter 13 

1. For a useful recent critique of the treatment of race hy psychologists, also 
demonstraring how difficult (impossible?) it is to be detached about this 
issue, see Yee et al. 1993, 

2. Lynn 1991c. 

3. Lynn 1987a. For a critique of Lynn’s early work, see Stevenson and Azuma 
1983. 

4. For those who want to reconstruct the debate, Lynn’s 1987 and 1991 re- 
view articles followed on earlier studies: Lynn 1977, 1978, 1982; Lynn and 
Hampson 1986b. For his response to Flynn’s 1987 critique, see Lynn 1987b. 

5. Chan and Vernon 1988. 

6. Lynn and Song 1994. 

7. lwawaki and Vernon 1988; Vernon 1982. 

8. Flynn 1991; Sue and Okazaki 1990. 

9. Flynn 1991. 

10. Lynn 1993 b. 

11. Lynn 1987a, 1987b, 1989, 1990a, 1990h, 1991h, 1991c, 1992, 1993a, 
1993b; Lynn and Hatton 1990; Lynn, Pagliari, and Chan 1988. 

12. Lynn, Hampson, and lwawaki 1987. 

13. Lynn 1991c. 

14. Stevensonetal. 1985. 


1 5. Lynn 1991a, p. 733. Lynn has noted that the mean white IQ in Minnesota 
is approximately 105, well above the average for the American white Pop¬ 
ulation. On the other hand, it is possiblc that the cities chosen in Japan 
and Taiwan were similarly elevated. 

16. An excellent account of the literature may be found in Storfer 1990, pp. 
314-321, from which our generalizations are taken. For Jews in Britain, see 
also Lynn 1992. 

17. Storfer 1990, pp. 321-323. 

18. As reported in Jensen 1984b, p. 479. 

19. Sattler 1988. 

20. A detailed and comprehensive review ot the literature through 1980 may 
be found in Osborne and McGurk 1982; Shuey 1966. For an excellent one- 
volume synthesis and analysis, see Loehlin, Lindzey, and Spuhler 1975. 

21. Standard deviations are explained in Appendix 1. 

22. To qualify, all studies had to report data for hoth a white and black sam¬ 
ple, with a sample size of at least fifty in each group, drawn from compara- 
ble populations that purported to be representative of the general 
Population of that age and geographic area (studies of special populations 
such as delinquents were excluded). Socioeconomic Status posed a special 
problem. It a study explicitly matched subjects by SES, it was excluded. If 
itsimply drew irssamplesfroma low-SES area, it was included, even though 
some degree of matching had occurred. The study had to use a Standard- 
ized fest of cognitive ability, al though not all of the in were IQ tests and not 
all included a complete battery. lf the scores were reported as lQs, a Stan¬ 
dard deviation of 1.5 was imputed if no Standard deviations for that sample 
were given. 

23. To get the IQ equivalent of SD differences, multiply the SD difference by 
15; hence, 1.08 X 15 = 16.2 IQ points. 

24. This figure is hased on non-Latino whites. The difference between blacks 

and the combined white-Latino sample in the NLSY is 1.12 SDs. Because 
the U.S. Latino population was proportional ly very small until the 1970s, 
the NLSY figure for non-Latino whites is more comparable to the earlier 
tests, in terms of definition of the sample, than the figure for the combined 
white-Latino sample, and we shall use it exclusively in discussions of the 
NLSY data throughout the chapter. ___________ 

25. The formula is a Diff. = (x„ - xj/^N.al + N,c;)/(N„. + N h ) , where N 
is the sample size, X is the sample mean, o is the Standard deviation, and 
w and h stand for white and black, respectively (taken from Jensen and 
Reynolds 1982, p. 425). Note that our white sample differs from the one 
used in Office of the Assistant Secretary of Defense (Manpower) (1982). 
The “white” sample in that report included all persons not identified as 
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Hispanic or black, whereas our “white” sample also excluded persons iden- 
tifying themselves as American Indians or a member of an Asian or Pacific 
ethnic group, The NLSY and the AFQT are described in the Introduction 
to Part II and Appendix 2. 

26. This is a very rough estimate. As of 1994 there were approximately 32.8 
million blacks in America. If the estimate is computed based on the mean 
IQ (86.7) and Standard deviation (12.4) of blacks in the NLSY, a table of 
the normal distribution indicates that only about 0.1 percent, or about 
33,000, would have IQs of 125 or higher. If one applies the observed distri- 
bution in the NLSY and asks what proportion of blacks are in the top five 
percent of the AFQT distribution (roughly corresponding to an IQ of 125), 
the result, 0.4 percent, implies that the answer is about 131,000. There are 
reasons to think that both estimates err in different directions. We com- 
promised with 100,000. 

27. For example, no external evidence for bias has turned up with the WISC, 
WAIS, Stanford-Binet, Iowa Test of Educational Development, California 
Achievement Test, SAT, ACT, GRE, LSAT, MCAT, Wonderlic Person¬ 
ne 1 Test, GATB, and ASVAB (including the AFQT in particular). 

28. If any bias has been found, it is primarily regarding performance in school, 
and it shows that test scores for blacks often “overpredict” performance; 
that is, the tests are biased “in favor” of blacks, tending for unknown rea¬ 
sons to predict higher performance than is actually observed. See Appen¬ 
dix 5 for details. 

29. Weiss 1987, p. 121. A separate argument, made in Zoref and Williams 
(1980), adduced evidence that verbal items in IQ tests are disproportion- 
ately based on white males “in role-stereotyped representations.” The au- 
thors do not present evidence that performance on these items varies by 
race or gender in ways that would indicate bias but rather indict the tests 
as a whole on the basis of their sexism and racism. 

30. The reason why the “oarsman:regatta” example has been used so often in 
descriptions of cultural bias is that it is one of the few items in the SAT 
that looks so obviously guilty. Perhaps if a test consisted exclusively of items 
that were equivalent to the example, it would be possible to demonsträte 
cultural bias statistically, but no modern test has more than a few that come 
close to “oarsman:regatta.” 

31. The definitive assessment of internal evidence of bias is in Jensen 1980. 

32. E.g., Valencia and Rankin 1988; Munford and Munoz 1980. 

33. For a review, see Jensen 1980. 

34* The NLSY has higher scores for whites than blacks on backward digit span 
and virtually no difference at all for forward digit span. In a similar way, 
SES differences within races are also greater for backward digit tests than 
forward digit tests (Jensen and Figueroa 1975). 


35. Gordon 1984. See Faneil 1983, and the attached responses, for an attempt 
to explain the difference in digit span results through cultural bias hy- 
potheses. 

36. Another commonly used apparatus involves a home button and a pair of 
other buttons, for yes and no, in response to tasks presented by a Computer 
console, The results from both types of apparatus are congruent. 

37. The literature is extensive, and we are bypassing which aspect of reaction 
time in fact covaries with g. For our purposes, it is only necessary that some 
aspects do so. For some of the issues, see, for example, Barrett, Eysenck, 
and Lucking 1986; Matthews and Dorn 1989; Vernon 1983; Vemon et al. 
1985. 

38. Jensen and Munro 1979. 

39. Jensen 1993b. 

40. The dependent variable is age-equated IQ score, and the independent vari¬ 
ables are a binary variable for race (white or black) and the parental SES 
index. The difference between the resulting predicted IQs is divided by the 
pooled weighted Standard deviation. 

41. Among the young women in the RAND study of adolescent pregnancy de¬ 
scribed in Chapter 8 (Abrahamse et al, 1988), drawn from the nationally 
representative Fligh School and Beyond sample, the same procedure re- 
duced the B/W difference by 32 percent. See also Jensen and Reynolds 
1982 and Jensen and Figueroa 1975. 

42. For some people, Controlling for Status is a tacit way of isolating the ge- 
netic difference between the races. This logic is as fallacious as the logic 
behind Controlling for SES that ignores the ways in which IQ helps deter- 
mine socioeconomic Status. See later in the chapter for our views on ge- 
netics and the B/W difference. 

43. In other major studies the B/W difference continues to widen even at the 
highest SES levels. In 1975, for example, Jensen and Figueroa (1975) ob- 
tained full-scale VTSC IQ scores for 622 whites and 622 blacks, ages 5 to 
12, from a random sample of ninety-eight California school districts. They 
broke down the scores into ten categories of SES, using Duncan s index of 
socioeconomic prestige based on occupation. They found a B/W discrep- 
ancy that went from a mere .13 SD in the lowest SES decile up to 1.20 SD 
in the highest SES decile, Going to the opposite type of test data, the 
Scholastic Aptitude Test taken by millions, selhselected with a bias toward 
the upper end of the cognitive distribution, the same pattern emerged. In 
1991, to take a typical year, the B/W difference among students whose par- 
ents had less than a high school diploma was .58 SD (averaging verbal and 
mathematical scores), while the B/W difference among students whose 
parents had a graduate degree was .78 SD. (National Ethnic/Sex Data for 
1991, unpublished data available by request from the College Board). In 
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their separate veviews of the literature, Audrey Shuey (whose review was 
pubiished in 1966) and John Loehtin and his colleagues (review published 
in 1975) identified thirteen studies conducted from 1948 rhrough the early 
1970s that presented IQ means for low- and higlvSES groups by raee, In 
twelve of the thirteen studies, rhe black-whire difference in IQ was higher 
for the higher-SES group than for rhe lowenSES group. For similar results 
for the 1981 Standardization of the WAIS-R, see Reynolds et al. 1987. A 
final com ment is that rhe NLSY also shows an increasing B/W difference 
at the upper end of the socioeconomic scale when the 1980 AFQT scor- 
ing System is used and the scores are not corrected for skew. See Appendix 
2 for a discussion of the scoring issues. 

44- Kendall, Verster, and Mollendorf' 1988. 

45. Kendall, Verster, and Mollendorf 1988. For another example, rhis time of 
an entire book devoted to testing in the African setring that fails to mein 
tion a single mean, see Schwarz and Krug 1972. 

46. Lyon 1991c. 

47. Boissiere et ab 1985. 

48. Owen 1992. 

49. Reynolds et al. 1987. 

50. Vincent 1991. 

51. Vincenr also cites rwo nonnormarive studies of chiLiren in which rhe B/W 
differences ranged from only one to nine points. These are the differences 
after Controlling for SES, which, as we explain in the text, shrinks the B/W 
gap by about one-third. 

52. Jensen 1984a; Jensen and Naglien 1987; Naglien 1986. They poinr out 
that the K^ABC lest is less saturated with g than a convemional IQ mea- 
sure and more Jependent on memory, both of which would tend to reduce 
the B/W difference (Naglieri and Bardos 1987). 

53. Jensen 1993h. 

54. Based on the white and black SDs for 1980, rhe first year that Standard de-- 
viarions by race were published. 

55. Wainerl988. 

56. Our reasons for conciuding that the narrowing of the B/W differences on 
the SAT was real, despite the potential artifacts involved in SAT score, are 
as follows. Regarding the selhselection prohlem, rhe key consideration is 
that the proportion of blacks taking the test rose throughout the 
1976-1993 period (including the suhperiod 1980-1993). ln 1976, Nacks 
whotook the SAT represented 10 percent of black 17-year-olds; in 1980, 
the proportion had risen to 13 percent; by 1993, it had risen to abour 20 
percent. While this does not necessarily mean that blacks taking the SAT 
were coming from lower socioeconomic groups (the data on parental edie 


catiun and income from 1980 to 1993 indicate they were not), the pool 
probably became less selective insofar as it drew from lower portions of the 
ahility distribution, The improvement in black scores is therefore more 
likely to be understated by the SAT data than exaggerated. 

Howard Wainer (1988) has argued that changes in black test scores are 
uninterpretable hecause of anomalies that could he inferred from the test 
scores of students who did not disclose their ethnicity on the SAT hack" 
ground questionnaire (nonresponders). Apart from several technical ques~ 
tions about Wainer’s conclusions that arise from his presentation, the key 
point is that the nonresponder population has diminished substantially. As 
it has diminished, there are no signs that the story told by rhe SAT is chang- 
ing. The basic shape of the falling trendline for the black-white difference 
cannot plausibly be affected hy nonresponders (though the true means in 
any given year might weil be somewhat different from the means based on 
those who identify their ethnicity). 

57. The ränge of .15 to .25 SD takes the data in both the text and Appendix 
5 inro account. To calculate the narrowing in IQ terms, we need to esth 
mate rhe correlation hetween IQ and the various measures of educational 
preparation. A lower correlation would shrink the estimate of the amount 
of IQ narrowing hetween blacks and whites, and vice versa for a higher es^ 
timate. The two~ to three-point estimate in the text assumes that this cor- 
relation is somewhere hetween .6 and .8. If we instead re ly entire ly on the 
SAT data and consider it to be a measure of intelligence per se, then the 
narrowing has been four points in IQ, bin only for the population that ac~ 
tually takes the test. 

58. A change of one IQ point in a generation for genetic reasons is not out of 
the realm of possihility, given sufficient differential fertility. However, the 
evidence on differential fertility (see Chapter 15) implies not a shrinking 
black-white gap hur a growing one. 

59. Jaynes and Williams 1989; Jencks and Peterson 1991. 

60. Linear extrapolations are not to be taken seriously in these situations. A 
linear continuation of the black and white SAT trends from 1980 to 1990 
would bring a convergence with the white mean in the year in 2035 on 
the Verbal and 2053 on the Math. And when it occurs, racial differences 
would not be ended, for if we apply the same logic to the Asian scores, in 
that year of 2053 when blacks and whites both have a mean of 555 on the 
Math test, the Asian mean would be 632. The Asian Verbal mean (again, 
based on 1980-1990) would be 510 in the year 2053, forty^seven points 
ahead of the white mean. But—such is the logic of linear extrapolations 
from a short time period—the black Verbal score would hy that time have 
surpassed the white mean by thirty-seven points and would be 500, only 
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ten points behind rhe Asians. ln 2069, the black Verbal mean would sur- 
pass the Asian Verbal mean. Linear trends over shorr periods of rime can- 
not he sensibly extrapulated much into the future, notwithstanding how 
often one sees such extrapolations in the media. 

61. See Appendix 5 for ACT results. ln short, the mean rose from 16.2 to in 
1986 to 17.1 in 1993. The number of black ACT students also continued 
to rise during this period, suggesring that the increase after 1986 was not 
the result of a more selective pool. 

62. Chapter 18 explores this line of rhuught turther. 

63. SAT trends are subject to a variety of quesrions relating to the ehanging 
nature of the SA l pool. The discussion that follows is based on unreported 
analyses checking out the possibility that the results reflecr these poren- 
tia l artifacts (e.g., changes in the Proportion of Asians using English as their 
first language; changes in the proportion of students coming f roin homes 
where the parents did not go to College). The discussion of these matrers 
may be found in Chapter 18. 

64- The first year for which a frequency distribution ofscores by ethniciry bas 
been published is 1980. 

65. Trying to predict trends on the basis of equivalent percentage changes from 
different baselines is a treacherous proposition. A comparison with black 
and Asian gains makes the point. For example, the percentage of hlacks 
scoring in the 700s on the SAT-Verbal grew by 23 percent from 1980 to 
1990, within a percentage point of the Asian proportional increase. For 
students scoring in the 600s, the black increase was 37 percent, not für be¬ 
it >w the Asian increase of 48 percent. The difficulty with using proportions 
in this instance is that the baselines are so different. Take rhe ease of stu- 
dents scoring in rhe 600s on the SAT-V, for example. The proportions that 
produced that 37 percent increase for hlacks were eleven students out of a 
thousand in 1980 versus fifteen students out of a thousand in 1990. The 
Asian change, put in the same metric, was from fifty-five studenrs in 1980 
to eighty-une studenrs in 1990. For every four students per thousand that 
blacks gained in the 600 group, Asians gained rwenty-six per thousand. 

66. This Statement is based on a calculation that assumes that the 1980 dis¬ 
tribution of scores remained the same except for the categories of internst. 
To illustrate, in 1980, 19.8 percent of black students scored from 200 to 
249. In 1993, only 13.1 percent scored in that ränge. Suppose that we treat 
the percentage distribution for 1980 as if it consisted of 1,000 students. ln 
that year, 198 of those students scored in the 200 to 249 ränge. We then 
recompute rhe mean for the 1980 distribution, substituting 128 for 198 in 
the 200 to 249 point category (assigning midpoinr values to all the inter- 
vals to reach a grouped mean), so in effect we are calculating a mean for a 
fictitiüUS Population of 1000 - 198 + 128 = 930. (The arfiial raln.latinn* 


used unroundeci proportions hased on the actual frequencies in each in- 
terval.) 

A technical note for those who might wish to reproduce this analysis: 
When means are computed from grouped data, the midpoinr of an inter- 
val is not necessarily the acrual mean of people in that interval, usually be- 
cause more than 50 percent of the scores will tend to be found in the fatter 
pan of the distribution covered by the interval hut also hecause scores may 
be bunched at rhe extreme categories. In the SAT-Math, for example, a 
disproporrionate number of the people in the interval from 750 to 800 have 
scores of 800 and of those in the interval from 200 to 249 have scores of 
200 (hecause they guessed wrong so often that their score is driven down 
to the minimum). Such effects can produce a noriceable bias in the esti- 
mated mean. For example, the actual verbal mean of black students in 1980 
was 330. lf one eomputes the mean hased on rhe distribution published an- 
nually by the College Board, which run in fifty-point intervals from 200 
to 800, the result is 336.4- The actual mean in 1990 was 352; the grouped 
mean is 357.9. The computed figure in the text is hased on the Surrogate 
mean as described above compared to tbe grouped 1980 and 1990 means, 
to provide a ch insistent framework. 

67. The contrast with the Asian experience on rhe SATs is striking. The Asian 
Math mean rose from 509 to 535. Of this increase, nt me of it was due to 
decreases in students scoring less than 200 (compared to 22 percent for 
blacks), while a remarkable 54 percent was due to gains in the 700 and up 
group (compared to 3 percent for blacks). Meanwhile, on the Verbal test, 
rhe Asian mean rose from 596 to 415 from 1980 to 1993. Of this, only 17 
percent occurred hecause of reducrions in Asians scoring in the 200s (com¬ 
pared to 51 percent. for blacks), while 9 percent occurred hecause of in- 
creases in Asians scoring in the 700s (compared to 0.4 percent for blacks). 
The Asian increase in test scores has heen driven by improvemenfs among 
rhe best students, while the black increase has heen driven by improve- 
ments among the worst students. We are unahle to find any artifacts in the 
ehanging nature of the black and Asian SAT pools that would explain 
these results. The continued Asian improvement makes it diffieult to 
Maine the slowdown in black improvement in the last deeade on events 
that somehow made it impossible for any American students to make 
progress. Explanations could be advanced based on events specific to 
blacks. 

68. Snyderman and Rothman 1988. The sample was based on random selec- 
tions from the Members and Fellows of the American Educational Re¬ 
search Association, National Council on Measurement in Education, six 
divisions of the American Psychological Association (Developmental Psy¬ 
chologe Educational Psychology, Evaluation and Measurement, School 
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Psychology, Counseling Psychology, and Industrial and Organizational 
Psychology), rhe Behavior Genetics Association, the Cognitive Science 
Society, and the education division of the American Sociological Associ¬ 
ation. 

69. Brody 1992, p. 309. 

70. Gould 1984, pp. 26-27. 

71. Gould 1984, p. 32. See Lewontin, Rose, and Kamin 1984, p. 127, for a sinn 
ilar argument. 

72. Gould 1984, p. 33. 

73. The ramitications for public policy are dealt with in detail in Chaprers 19 
and 20, conceming affirmative action. 

74. We do not include in the text any discussion of Phillipe Rushtons intensely 
controversial writings on the differences among Asian, white, and black 
popularions. For a brief account, see Appendix 5. 

75. A situilar example can he found in Lewontin 1970, one of the most our- 
spoken critics of the IQ enterprise in all its manifesrations. 

76. The calculation proceeds as follows: The Standard deviation of IQ being 
15, the variance is therefore 225. We are stipulating thar environment ac- 
counrs for .4 of rhe variance, which equals 90. The Standard deviation of 
the distribution of the environmental component of IQ is the square root 
of 90, or 9.49. The difference hetween group environments necessary to 
produce a fifteeivpoint difference in group means is 15/9.49, or 1.58, and 
the difference necessary to produce a three-poinr difference is 3/9.49, or 
.32. The comparahle figures if heritahility is assigned the lower hound value 
of .4 are 1.28 and .26. If heritahility is assigned the upper-bound value of 
.8, rhen the comparahle figures are 2,24 and .45. 

77. Stevenson er ab 1985. 

78. Lynn 1987a. 

79. Frydraan and Lynn 1989. 

80. Iwawaki and Vernon 1988; McShane and Berry 1988. 

81. Vernon, 1982 p. 28. It has heen argued that the 110 figure is too high, but 
a verbal-visuospatial difference among Asian Americans is not dispuied 
(Flynn 1989). 

82. Supplemental evidence has been found among Chinese students living in 
China who were given the SAT Several hundred Chinese students in 
Shanghai hetween the ages of 11 and 14 scored exrremely high on the Math 
SAT, despire an almost total lack of familiarity with American cognitive 
abiiity testing. As a proportion of the total population, this represenred a 
far greater density of high math scorers in Shanghai than in the United 
States. Further attempts to find high scorers in Chinese schools confirmed 
the original results in Shanghai (Stanley, Feng, and Zhu 1989). 

83. The SAT data actually provide even rnore of a hint about genetic origins 


for the test-score pattern, though a speculative one. The College Board re- 
ports scores for persons whose first language leamed is English and for those 
whose first language is “English and another.” Ir is plausible to assume that 
Asian students whose only “first language“ was English contain a dispro- 
portionate number of children of mixed parentage, usually Asian and 
white, compared to those in whose homes hoch English and an Asian lan- 
guage were spoken from hirth. With rhat hypothesis in mind, consider that 
the discrepancy hetween the Verbal and Math SATs was (in IQ points) 
only 1.7 points for the “English only“ Asians and 5.3 points for the “Eng¬ 
lish and another” first-language Asians. Nongenetic explanations are 
available. For example, one may hypothesize that although English and an- 
other language were hoch “first languages,“ English vvasn’t leamed as well 
in those homes; hence the Verbal scores for the “English and another” 
homes were lower. But then one must also explain why the Math scores of 
the “English and another“ Asians were twenty-one SAT points higher than 
the "English-only” homes. Here one eould hypothesize that the “English- 
only” Asians were secoruT and third-generation Americans, more assimi- 
lated, and therefore didn’t srudy math as hard as their less assimilated 
friends (although somehow they did quite well in the Verbal tesr). But 
while alternative hypotheses are available, the consistency with a genetic 
explanarion suggests that it would he instructive to examine the scores of 
children of full and mixed Asian parentage. 

84. A related topic that we do not review here is the comparison of blacks and 
whites on Level I and Level 11 abilities, using Jensens two-level theory of 
mental abilities (Jensen and Figueroa 1975; Jensen and Inouye 1980). The 
Undings are consistent with those presented under the discussion of WISC- 
R profiles and Spearman’s hypothesis. 

85. “Spearman’s hypothesis” is named after an observation made by Charles 
Spearman in 1927. Noting that the black-white difference varied system- 
atically for different kinds of tests, Spearman wrote thar the mean differ¬ 
ence “was most marked in just those [rests] which are known to he most 
saturated with g” (Spearman 1927, p. 379). Spearman himself never tried 
to develop his comment into a formal hypothesis or to test it. 

86. Jensen and Reynolds 1982. 

87- Jensen and Reynolds actually compared large sets of IQ scores with the 
full-scale IQ score held constant statistically. 

88. Jensen and Reynolds 1982, p. 427; Reynolds and Jensen 1983. 

89. Jensen and Reynolds 1982, pp. 428-429. 

90. Jensen 1985, 1987a. 

91. Jensen 1993b. 

92. Braden 1989. 

93. Jensen 1993b. 
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94. The correlations between g loading and black-white difference are typi- 
cally in the .5 to .8 ränge. 

95. A concrete example is provided by the Kaufman Assessment Battery for 
Chi Liren (K-ABC), a test that attained some visibility in part because 
the Separation between black and white children on it is smailer than on 
more Standard intelligence tests. It was later found that K-ABC is a less 
valid measure of g than the Standard tests (Jensen 1984a; Kaufman and 
Kaufman 1983; Naglieri and Bardos 1987). 

96. E.g., Pedersen et al. 1992. Jensen limits himself to discussing Spearman ’s 
hypothesis on the phenotypic level. 

97. Jensen 1977. 

98. Some other studies suggesr a systematic sibling difference for national 
populations, but it goes the other way: Eider siblings outscore younger 
sibiings in some data sets. However, this “birth-order” effect, when it oc- 
curs at all, is much smaller than the effect Jensen observed. 

99. Jensen 1985, 1987a. 

100. Various technical arguments were advanced against Jensen’s claim that 
biacks and whites differ the most on tests that are the most highly loaded 
ong. Many of these were effect ive ly resolved wich in the forum. One critic 
hypothesized that Jensen’s findings resulted from an artifact of varying 
reliabilities (Baron 1985). Jensen was able to demonstrate that correc- 
tions for unreliahility did not wash out the evidence for Spearman’s hy- 
pothesis and that some of the tests wirh low g loadings had high 
reliabilities to begin with, contrary to the critic’s assumption. Another 
commentator suggested that Jensen had inadvertently hui11 inro hisown 
analysis the very correlation between g loading and black-white differ- 
ence that he purported to discover (Schonemann 1985; see also Wilson 
1985). In the next round (the forum oceupied two issues of the journal), 
after being apprised of a response by physicist William Shockley (Shock- 
ley 1987), he withdrew his argument. A less serious criricism suggested 
that black-white differences did indeed correlare with some general fac- 
tor that rums up to varying degrees in different intelligence tests but that 
the factor may not beg (Borkowski and Maxwell 1985). To this criricism, 
Jensen was able to demonstrate that the g factor accounted for so large a 
fraction of the total variance in test scores that no other general factor 
could possibly he comparably correlated with hlack-white differences. A 
still less serious criricism (indeed, barely a criticism at all), made hy sev- 
eral commentators, was that the g that tums up in one battery of tests is 
likely to differ from the g that tums up in another (e.g., Kline 1985). 
Jensen accepted this point, noting, however, that the various gs are them- 
selves intercorrelated. 


A number of critics took a nontechnical tack. One set argued that 
Jensen’s analysis was conceptually circular. For example, if g is defined as 
intelligence, then tests that are loaded on g will he considered tests of in¬ 
telligence. If these happen, coincidentally, to be the tests that black and 
whites differ on, then Spearman’s hypothesis will seem to be confirmed, 
though the link between the tests and intelligence was simply postulated, 
not proved (Brody 1987). For a related argument see Macphail 1985. 
Jensen acknowledged that he had not tried to discuss the relationship of 
g to intelligence in this parricular article. Another set of critics made 
what could be called meta-critical comments, wondering why Jensen 
should want to uncover telationships that are not very interesting (Das 
1985), hurtful to biacks (Das 1985), inimical to world peace (Bardis 
1985), and likely to distract attention from the possibility of raising peo- 
ple’s g by educational means (Whimbey 1985). None of these commen- 
taries disputed that the data show what Jensen said they show. 

A tew years later, the last paper written by the noted psychometrician, 
Louis Guttman, before his death, attempted to demonstrate a mathe- 
marical circularity in Jensen’s argument, concluding that Spearman’s hy¬ 
pothesis is true by mathematical necessity (Guttman 1992). He argued 
that the factor analytic procedures that are used to extract an estimate 
of g cannot fail to produce a correlation between g and the B/W differ¬ 
ence. If the correlation is present by necessity, concluded Guttman, it 
can’t be telling us anything about nature. The gisr of Gutrman’s case is 
that if g is the only source of correlation across tests, then the varying 
B/W differences across tests must be correlated with g. Jensen and others 
were quick to point out that noone now believes that g is the only source 
of correlation between tests, just the largest one. We will not try to re- 
produce Guttman’s mathematical argument, not just because it would get 
us deep into algebra but because it was decisively refuted by other psy- 
chometricians who commented on it and seems to have found no other 
support since its publication. See Jensen 1992; Loehlin 1992; Roskam 
and Ellis 1992. 

101. Gustafsson 1992. 

102. Mercer 1984, pp. 297-310. 

103. Mercer 1988. 

104- Mercer 1988, p. 209. 

105. It would be useful for the reader if we could present Mercer’s resulrs so 
that they parallel the method we have been using, in which the socio- 
cultural variables and ethnicity are treated as independent variables pre- 
dicting IQ, but her presentation does not include that analysis. 

106. Mercer 1988, p. 208. 
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107. The critique of Mercer’s position has been highly technical. Readers who 
have the patience will find an extended exchange berween Mercer, 
Jensen, and Robert Gordon in Reynolds and Brown 1984. 

108. Mercer 1984, Tables 6, 9; Jensen 1984b, pp. 580-582. 

109. Boykin 1986, p. 61. 

110. For review, see Boykin 1986. 

111. Ogbu 1986. 

112. Flynn 1984, 1987a, 1987b. 

113. Merrill 1938. 

114. Flynn 1984, 1987b; Lynn and Hampson 1986c. 

115. Flynn 1987a, 1987b. 

116. Lynn and Hampson 1986a. 

117. Teasdale and Owen 1989. 

118. For evidence that this is what has happened in the United Stares, see 
Murray and Hermstein 1992. 

119. If the mean IQ in 1776 had been 30 and the Standard deviation was what 
ir is today, then America in the Revolutionär period had only five men 
and women with IQs above 100. 

120. Lynn and Hampson 1986a. 

121. Consider the analogy of height. The average stature of Americans has 
risen several inches since the Pilgrims landed at Plymouth, but height 
has run in families nevertheless. 

122. A shifting link hetween IQ and intelligente is not only possible but prob¬ 
able under certain conditions. For example, when the literacy level of a 
country rises rapidly, scores on conventional intelligence tests will also 
rise because more people will be better able to read the test. This rise is 
unlikely to be fully reflected in a rising intelligence level, at least with 
equal rapidity. Flynn 1987b discusses this general measuremenr issue. 

12 3. Scarr and Weinberg 1976, 1978, 1983; Weinberg, Scan and Waldman 
1992. 

124- Weinberg, Scarr, and Waldman 1992, Table 2. The progression of the IQ 
means from two black parents to one black/one white to two white par- 
ents is not as neatly supportive of a genetic hypothesis as mighr first ap- 
pear, because there is reason to suspecr that the mixed-race biological 
parents of the adopted children were disproportionately drawn froro Col¬ 
lege students, which in turn would imply that the [Q of the black parent 
was well above the black mean. 

125. Weinberg, Scan, and Waldman 1992. For the technical debate, see Levin 
in press; Lynn in press, with a response by Scarr and Weinberg in Wald¬ 
man, Weinberg, and Scarr in press. 

126. Weinberg, Scarr, and Waldman 1992, Table 2. The overall decline in 


scores for all groups was because a new test norm had been imposed in 
the interim, vitiating the Flynn effect for this group. 

127. Waldman, Weinberg, and Scarr in press. 

128. Eyferth 1961 For accounts in English, see Loehlin, Lindzey, and Spuhler 
1975; Flynn 1980. 

129. Loehlin, Lindzey, and Spuhler 1975, Chap. 5. 

130. An earlier study showed no significant association hetween the amounr 
of white ancesrry in a sample of American Blacks and their intelligence 
test scores (Scarr et al. 1977). tf the whires who contrihuted this ances¬ 
rry were a random sample of all whires, then this would be strong evi¬ 
dence of no genetic influence on Black-white differences. There is no 
evidence one way or another about rhe nature of the white ancestors. 

131. Lewontin, Rose, and Kamin 1984. 

132. Scarr and Weinberg 1976, Table 12. 

Ghapter 14 

1. U.S. Department of Labor 1993, Table 3. 

2. U.S. Rureau of the Census 1993, Table 1. 

3. The NLSY sample does not include GEDs. Nationally, the 1991 high 
school completion rate (signifying twelve years of school) was 87.0 per- 
cent for whites, 72.5 percent for blacks, and 55.4 percent for Latinos (Na¬ 
tional Center for Education Statistics 1993, p. 58). 

4. These results refer to a logistic analysis in which the dependent variable 
was a binary variable represenringobtaining a normal high school diploma. 
The independent variables were age and IQ. 

5. For persons ages 25 to 29 in 1992, the proportions with bachelor’s degrees 
were 26.7 percent for whites, 10.6 percent for blacks, and 11.4 percent for 
Latinos (National Center for Education Statistics 1993, p. 62). 

6. Welch 1973. 

7. For example, given the mean years of education for people entering rhe 
high-IQ occupations defined. in Chapter 3 (16.6) and hol ding age constant 
at the mean, the prohability that whites would Be in a Bigh-IQ occuparion 
was 14-4 percent compared to 12.8 percent for blacks and 18.1 percent for 
Latinos. 

8. Gottfredson 1986. 

9. Gottfredson 1986 leaves room for the possibility that blacks at the upper 
end of the IQ distribution were disproportionately choosing medicine, en- 
gineering, or rhe other professions she happened to examine. Perhaps if 
she had examined other high-lQ occupations (one may hypothesize), she 
would have found blacks represented at or below expectations. Our analy- 
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sis incorporating a broad ränge of high-IQ occupatio™, makes this hy- 

pot esis highly unlikely. The extension of the analysis in Chapter 20 rules 
it out altogether. 

10. The proportions in high-IQ occupations were 5.8 percent for whites, 3.1 
percent for blacks, and 3.7 percent for Latinos. 

11. After Controlling for IQ, the unrounded proportions in high-IQ occupa¬ 
tions were 10.4 percent for whites, 24.5 percent for blacks, and 16.2 per- 
Cent for Latinos, 

12. “Year round” is defined as people who reported being employed for fifty- 
two weeks in calendar 1989 and reported wage income greater than 0 (ex- 
cluding a small number who apparently were self-employed and did not 
pay themselves a wage). 

13. This result is based on a regression analysis when the wage is the depen- 
dent variable, age is the independent variable, and the analysis is run sep- 
arately for each race. The figures reported reflect the mean for a black and 
white of average age in the NLSY sample. 

14. For a more detailed technical analysis of the NLSY experience, reaching 
the same conclusions, see O’Neill 1990.0’Neill’s collateral findmgs about 
the joint role of education and IQ are taken up in Chapter 19. 

15. U.S. Bureau of the Census 1993, Table 29. 

16. Precisely, 64-4 percent higher, computed using unrounded poverty rates. 

17. For various approaches, see Bianchi and Farley 1980; Jargowsky 1993- 
Massey and Eggers 1990; Smith and Welch 1987, Eggebeen and Lichter 
1991. For a summary of the literature, see Jaynes and Williams 1989. 

18. U.S. Department of Labor 1993, Table 3. 

19. For civilian males not in school and not prevented from working by health 
Problems. 

20. Wilson 1987, Lemann 1991, Holzer 1986; Kasarda 1989; Topel 1993, 
Jaynes and Williams 1989. 

21. The proportions in 1960 were 66 percent (blacks) and 72 percent (whites). 
Computed from Tables 1 and 16, National Center for Health Statistics 
1993, and comparable tables in earlier editions. 

22. William Julius Wilson is best known for the lack-of-marriageable-males 
thesis (Wilson 1987), which is currently thought to have some explana- 
tory power (like IQ) but leaves the bulk of the discrepancy unexplained 
(as does IQ). See South 1993; Fossett and Kiecolt 1993; Bulcroft and Bul- 
croft 1993; Schoen and Kleugel 1988; Lichter, LeClere, and McLaughlin 
1991. For other empirical work bearing on the thesis, see Bennett, Bloom, 
and Craig 1989; Tucker and Taylor 1989; South and Lloyd 1992; Spanier 
and Glick 1980; Staples 1985. 

23. National Center for Health Statistics, 1993, Table 26. Figures in the text 
are for live births. 


24. E.g., Anderson 1989; Bumpass and McLanahan 1989; Duncan and Laren 
1990; Ellwood and Crane 1990; Furstenberg et al. 1987; Hogan and Kita" 
gawa 1985; Lundberg and Plotnick 1990; Murray 1993; Rowe and Rodgers 
1992; Teachman 1985, 

25. Computed from Committee on Ways and Means and U.S. House of Rep" 
resentatives 1993, pp. 688, 697; SAUS 1993, Table 23. 

26. These figures, already high, are even higher when the analysis is limited 
to mothers. The percentages of mothers who had ever been on welfare for 
blacks, Latinos, and whites, were 65,0,40.5 and 21.8, respectively. We com 
ducted parallel analyses limited to women who had borne a child prior to 
1986, giving at least five years’ “chance” for a woman to show up on the 
AFDC roles. This had the predictable effect of slightly increasing the per" 
centages of women who had ever received AFDC, but yielded the same 
substantive conclusions. 

27. Intergenerational transmission has some role. See McLanahan and 
Bumpass 1988; McLanahan 1988. For other discussions touching on racial 
differences in welfare recipiency, see An, Haveman, and Wolfe 1990; Bern" 
stam and Swan 1986; Bianchi and Farley 1980; Donnelly and Voydanoff 
1991; Duncan and Hoffman 1990; Hirschl and Rank 1991; Hofferth 1984; 
Hogan, Hao, and Paush 1990; Honig 1974; Hütchens, Jackson, and 
Schwartz 1987; Smith and Welch 1989; Wiseman 1984, Hoffman 1987; 
Rank 1988; Zabin et al, 1992. 

28. National Center for Health Statistics 1993, Table 26. 

29. Based on the Colorado Interuterine Growth Charts. 

30. For discussions of reasons for the black-white gap in low'birtbvweight ba" 
bies see David 1990; Kempe et al. 1992; Mangold and PowelLGriner 1991. 

31. U.S. Bureau of the Census 1993, Table 3. The Bureau of the Census does 
not break out “nomLatino whites” in the official statistics. If one assumes 
that all persons labeled as “Hispanic origin” were white, then 12.9 percent 
of non"Latino white children were under the poverty line, This is an um 
derestimate for the actual figure, since many persons of Hispanic origin are 
classified as black. The figure of 14 percent in the text is an estimate that 
attempts to compensate roughly for the underestimate. 

32. The reasons for the gap in black and white child poverty are discussed in 
the same literature that deals with differences in marriage rates and ille" 
gitimacy, which together account for much of the differing financial sitm 
ations facing black and white mothers of young children. 

33. Various approaches to ethnic differences in home environment are Heath 
1982; Bardouille"Crema, Black, and Martini986; Field et al. 1993; Kelley, 
Power, and Wimbush 1992; McLoyd 1990; Moore 1985; Pearson et al. 
1990; Radin 1971; Tolson and Wilson 1990; Wassermanet al. 1990. A use" 
ful older account is Davis and Havighurst 1946. 
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34. See Jones 1992 on abortion, Abramson and Claggett 1991 on voting, and 
Elliott and Ageton 1980 on delinquency. 

35. See the references (note 33) regarding et Knie differences in home envi- 
ronment. 

36. Refers to arrests for index criines in 1992 relative to the size of the black 
and white populations. Compured from Federal Bureau of Investigation 
1993, Table 43, and SAUS 1993, Table 22. See also Wilson and Herrnstein 
1985, Chap. 18. 

37. U.S. Bureau of the Census 1993b, Table 305. 

38. R. Gordon 1976, 1987. 

39. We cannot use theNLSY self-report data for inter-racial comparisons. Seih 
report crime measures have consistently revealed marked differences in the 
willingness of black and white youths to disclose crimes. See Elliott and 
Ageton 1980; Hindelang 1981; Hindelang, Hirschi, and Weis 1981. 

40. See the sixteen studies reviewed in Osbome and McGurk, 1982. See also 
rhe results from the Philadelphia delinquency eohort (Wolfgang, Figlio, 
and Sellin 1972). 

C hapter 15 

1. We would, of course, need to know something about the fathers’ scores too. 
The more complete account com es later in the c hapter. 

2. Alsosee Ghiselin and Scudo 1986; Ingie 1973. 

3. Soloway 1982. 

4. Francis Gal ton s coined the term eugenic. See Gal ton 1883. 

5. The eugenicists were active, but, as we noted in the Introduction, the in- 
telligence testers were not. For an account of what happened prior to the 
passage of the Xenophobie and nativ ist Immigration Restriction Act of 
1924 and how it has gotten distorred in the retelling, see Snyderman and 
Herrnstein 1983. 

6. ‘intrinsic birth rares” are birth rates corrected for age distributions. Death 
rates also decline during the demographic transition, but rhey will not he 
discussed in any detail he re. Demographers generally helieve that differ¬ 
ential death rares cease to he a major factor in population growrh in mod- 
ernized societies like ours. This is a supposition that needs to be reassessed, 
given the probable differential impactof infant mortalities, homicide rares, 
and AIDS in relation to tested intelligence. Of all the studies we summarize 
below, only Retherford and Sewell 1988 rakes mortality rates into account, 
but it did not have a national ly representative sample to analyze. We may 
surmise that the intergenerational decline in intelligence is being 
mitigated somewhat by differential intrinsic death rates. 

7- Retherford 1986; Retherford and Sewell 1988; Vining 1986; Wrong 1980. 


8. Retherford 1986; Retherford and Sewell 1988. 

9. Becker 1981. 

10. E.g., Retherford and Sewell 1988; Rindfuss, Bumpass, and John 1980. 

11. Vining 1982a, Vining 1986. 

12. Vining 1986. 

13. For a sampling of studies that indicate the importance of attitudinal vari¬ 
ables for motherhood in many nations, see Rooth and Duvall 1981; Hass 
1972; Krishnan 1990; Mason and Palan 1981; Youssef 1978. 

14- Esrimating rhe phenotypic, as disringuished from che genotypic, change in 
intelligence across ge ne rat io ns is conceptually little more than a mar rer of 
toting up the population yielded across the distribution of intelligence, 
then aggregating the subtorals to get the Overall distribution of scores in 
the next generation, after first taking account of regression to the mean 
(Andrews 1990; Falconer 1966; Retherford and Sewell 1988). It is not nec- 
essary to include any esrimate for rhe heritability of intelligence. This sim- 
plicity in conception should not be confused with simplicity in actually 
making rhese calculations. Rarents in, say, successive decilesof intelligence 
may have differing intrinsic rates of population growth (or decline) be- 
cause of varying lifetime fertilities, varying ages at reproduction, and vary- 
ing mortality rares. Assortative mating by the parents (see Chapter 4) 
matrers in calculation only insofar as it influences the correlation between 
parents and children. Hence, if fertility is lower at higher levels of intelli- 
gence, then assortative mating for intelligence will speed the decline of the 
population intelligence because it increases rhe correlation between par- 
ents and children. Some of the studies that we eite focus on the genotypic 
decline rather than the phenotypic (e.g., Retherford and Sewell 1988). 
Since children resemhle the parents who rear them for environmental rea- 
sons as well as genetic, the population phenotype will change more rapidly 
than the population genotype. 

15. The best review of the early srudies is Anastasi 1956. See also Duncan 
1952; Olneck, Wolfe, and Dean 1980; Retherford and Sewell 1988; Van- 
Court and Bean 1985; Vining 1986. 

16. Cattell 1936, Catteil 1937. 

17. Retherford and Sewell 1988. 

18. Cook, 1951 p. 6. 

19. As Osborn and Bajema (1972) stated, “The distribution of births in an in¬ 
dustrial welfare-state democracy would become more eugenic as che envi- 
ronment improved with respecr to health, educational, and occupational 
opportunities, and particularly with respect to the spread of hirth control 
to the point where freedom of parenthood became a reality for all ettizens” 
(p. 344). The Eugenic Hypothesis was first stated in Osborn 1940. 
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20. Maxwell 1954; Scottish Council for Research in Education 1949. 

21. Catteil 1951. See also Tuddenham 1948. 

22. Higgins, Reed, and Reed 1962. 

23. Bajeina 1963, 1971; Olneck, Wolfe, and Dean 1980; Waller 1971. In ad- 
dition, as we explained in Chapter 1 3, the Flynn Effect wouid have masked 
any decline in IQ by demographic processes. 

24. Cattell 1974; Osbome 1975. 

25. Retherford and Sewell 1988. 

26. Vining 1982b. 

27. VanCourt and Bean 1985. 

28. Retherford and Sewell 1988. 

29. Ree and Earles 1991a. 

30. The simplest way to get around the estimates that scholars have derived 
wouid he to measure the IQs of successive generations, following parents 
and their children, but surprisingiy few studies of any size measure cogni- 
tive ability in both parents and children, and those few have always heen 
small studies conducted for specific purposes; none has met the crueial cri- 
terion of national representativeness. ln the United States, the NLSY has 
the potential to yield such estimates, if the study continues long enough, 
be cause it has already initiated a program of testing the children of the 
NLSY mothers. Asofnow, however, it providesno interpretable data ahout 
the national population as a whole. The women of the NLSY are only pari - 
way thruugh their childbearing years (ages 25 to 3 3 as of our last observa- 
tion), and the children of the sample are atypical in thar they were 
disproportionately born to young mothers, who may differ in their child- 
rearing practices from older mothers. The sample is still missing altogerher 
many of the children of women who delay childbearing, who in tum are 
disproportionately women with advanced education—and high IQs. We 
can use the mother-child testing data to extract a few eines about ethmc 
differences, described later in this chapter. 

31. See Chapter 17. 

32. Not everyone agrees rhat it is worrisome. In a recent contribution to the 
fertility debate, Samuel Preston and Cameron Campbell (1991) challenge 
the premise that negative differential fertility on the microlevel must mean 
falling national irnelligence on the macrolevel. Such negative differentials 
are compatible, they argue, with a constant, improving, or deteriorating 
intelligence distribution in the populärion as a whole. Ir all dopends on 
how the current differentials rclate ro past and future fertility patterns. The 
argutnent is densely mathematical, and neither the article not the two ac- 
companying commentaries lend themselves to easy summay. Interpreting 
the argument is complicated by the fact that the authors operarionalized 
their model with one of the only data sets in which the fertility differen¬ 


tial is not negative. However, the narrowest mathematical implication of 
their model remains accurate: It is possible to postulate conditions rhar 
produce a constant or even rising IQ in the face of negative fertility dif¬ 
ferent ials. The re is no reason to suppose that those special conditions pre- 
vail now or have in the recent past. James Coleman (1993) similarly points 
out in his commentary that these hypothetical conditions do not have 
much to do with what is known about the history of fertility, concluding 
that “their rejection of the common belief ahout the effect of fertility dif¬ 
ferences is not warranted. What they have done is not to answer the ques- 
rions involved, hut to frame the problem in a most useful way” (p. 1032). 

33. A population has a limited number of ova and an unlimited numher of 
sperm. Therefore, what matters for replacement (net of migration) is how 
many females are born and what their fertilities are. Hence, since slightly 
more than 50 percent of births are males and since a few of the females do 
not reach the age of reproduction, the average woman needs to have ap- 
proximately 2.1 births to attain replacement fertility. 

34. Sweet and Rindfuss 1983, Fig. 2. Other countries similarly show the im- 
pact of education on fertility. A study of Mexican women in which ur- 
banization, occupation, migration, and education were examined for their 
effects on fertility found that education was the main depressanr. See Pick, 
Butler, and Pavgi 1988. 

35. Based on completed fertility for women ages 35 to 44 in the Bureau of the 
Census s Current Population Survey, a nationally repräsentative sample, 
in June 1992 (Bachu 1993, Tahle 2). The mean IQ represents the aggre- 
gated means by educational level. This calculation assumes that the mean 
IQ of women at various educational levels is the same for women born from 
1948 to 1957 (the national sample represented in the figure on page 349) 
as it was for the NLSY women born from 1957 to 1964. Is this plausible? 
Women born from 1948 to 1957 graduated from high school from 1966 to 
1975, afterthe percenrageofstudentsfinishingHighschoolhadhititspeak, 
after the major shifts in educational recruitment to College had already 
changed for whites, and after aggressive affirmative action had begun for 
hlacks and to some extent for Latinos. We can think of no reason to as- 
sume that the mean IQ of NLSY women (bom from 1957 to 1964) at dif¬ 
ferent leveis of educational attainment was systematically different than 
for the cohort of women bom from 1948 to 1957, though it could have 
heen. 

36. The data report the education of the mother at the time she has a child, 
hut a very young mother may later go back to finish high school, and a 
woman with a bachelor’s degree may return for a masters or a Ph.D. ln as- 
cribing IQs based on educational attainment, it is important to base rhem 
on the final attainment, not just on the years of education at the time of 
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birth. Our procedure for doing so was as follows: Using che NLSY, we first 
established the difference between education at che time of birth and ed- 
ucation as of 1990, when. the youngest woman in the NLSY was reaching 
26. ln the first Version of our procedure, it was assumed that the Propor¬ 
tion of women who gave birth at ages 26 to 33 (the age ränge of 98 per- 
cent of NLSY women by the 1990 interview) who would suhsequenrly 
move into a new educational category (the categories were 0-11, 12, 
13-15, 16, and 17 or more years of education) was extreme ly small. We 
then computed an adjusted version of the table showing birrbs by age by 
race in National Center for Health Statistics 1993, Table 20, assuming 
eventual educational attainment equal to that observed in the NLSY (for 
example, 36.1 percent of NLSY women who had ten years of education 
when they first gave birth reported twelve years of education by 1990; we 
recomputed the NCHS cell assuming that 36.1 percent of the women in 
the NCHS figures who were shown as having ten years of education would 
eventually get twelve). We then used the adjusted matrix of births by age 
by race to estimate IQs, using the NLSY mean IQs for women with equiv- 
alent years of education. Note that this computation must be done using 
separate estimates by race, because of the large discrepancy between the 
IQs of blacks and whites of equivalent years of education. This first iiera- 
tion yielded an estimated mean IQ of mothers for the 1991 U.S. birth co- 
hört of 97.9. We then repeated the process, using a sample limited to births 
that occurred by the end of 1986, meaning that each mother had at least 
four years of postbirth observation to see if she went back to school. This 
version avoided the assumption that women ages 26 and over seldom go 
back to school, at the cost of reducing sample sizes and perhaps intruduc- 
ing sorne unrepresentativeness into the truncated sample. The estimated 
IQ for the mothers of 1991 U.S. birth cohorr using this procedure was 98.0. 

37. The actual figure, based on all births through 1990, was 95.7. It is produced 
by taking the mean (using sample weights as always) of the IQ associared 
with the mother of each child born to an NLSY mother. 

38. Out of every 100 women ages 30 to 34 in 1990, only 2 had their first birth 
that year; afrer age 34, the proportion feil rapid ly to near zero. See ßachu 
1991, Table 4. We realize that many readers know personally of numerous 
women who had their first babies in their late thirties. It is one more use- 
ful example of the difference between the world in which most of our read- 
ers live and the rest of the country. 

39. Women of the NLSY who had reached ages 32 to 33 may he expected to 
have borne about 83 percent of all the babies they will evet hear (inter- 
polated from National Center for Health Statistics 1991, Table 2). 

40. The biases will understate the age differential by cognitive dass because 
(based on known patterns of childbearing by women of different educa¬ 


tional groups) the largest change in the final mean age of births will occur 
among the brightest women. 

41. Bachu 1993, Table 2. 

42. This find ing echoes points made in other places. We showed earlier (see 
Chapter 8) that it is not IQ per se that depresses fertility hut the things 
that a higher IQ results in, such as more education (see Retherford and 
Sewell 1989; Rindfuss, Morgan, and Spicegood 1980). At given IQ scores, 
blacks get more school ing than either whites or Latinos (Chapters 13, 18). 
Hence we should not be surprised that, at given IQ scores, blacks have 
lower fertility than either of the other groups; they are more likely to he 
still in school. 

43. Rindfuss, Morgan, and Spicegood 1980; Osbome 1973; Chen and Morgan 
1991b. 

44. Chen and Morgan 1991a; Rindfuss, Morgan, and Spicegood 1988. 

45. The quotation is taken from Baker and Mott 1989, p. 24- 

46. To mention just one of the most important reasons to hedge, the partici- 
pation of Latino mothers in the NLSY testing program was comparatively 
low, making the white-Latino comparison quite tentative. And as we cau- 
tioned in Chapter 14, the PPVT is probably less valid for Latinos than for 
other groups. This may bear on the comparison between Latino-white dif- 
ferences among mothers and among children. ln any case, the figure for 
the apparent dysgenic effect for the Latino-white comparison is small 
enough to deter strong conclusions. 

ln contrast, the black-white apparent dysgenic effect is large, and we 
examined it using several methods to see it it might be spurious. The tahle 
on page 356 reports the results using the children’s sample weights, and 
comparing tested children with the mothers of those children, counting a 
mother more than once if she had more than one child and counting the 
same child more than once if he or she had been tested in more than one 
year (after turning 6). If we repeat the same calculation but including all 
children who were tested (including those under the age of 6), the black- 
white difference among the mothers is 13.9 points, compared to a differ¬ 
ence among the children of 20.0 points, an even larger dysgenic difference 
than the one produced by the children ages 6 and older. Another approach 
is to discard the sample weights (which are problematic in several respects, 
when comparing across test years) and instead restrict the sample to chil¬ 
dren bom to mothers who were in the cross-sectional NLSY sample. Do¬ 
ing so for all children who took the PPVT after the age of 6 produces a 
B/W difference of 14.8 points for the mothers and 18.1 points for the chil- 
dren, or a dysgenic difference of 3.3 points. Doing so for all children who 
took the PPVT produces a B/W difference of 14.9 points for the mothers 
and 19.4 for the children, or a dysgenic difference of 4.5 points. 
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Our next Step was to examine separately the results from the three test 
years (1986, 1988, and 1990). For the children who were 6 or older when 
they took the test (which again shows a smaller difference than when the 
test includes all children), the B/W differences for the three test years, 
using sample weights, were 5.9, 1.9, and 3.0 points, respectively. The dif- 
ferences across test year did not affect the conclusion that a significant 
dysgenic effect exists, hut the reasons for the differences are worth inves- 
tigating. 

ln our attempt to see whether the dysgenic effect could be attenuated, 
we repeated all of these analyses with one difference; Instead of using the 
national norms for the PPVT (normed to a mean of 100 and SD of 15), we 
let the NLSY children be their own reference group, comparing the black 
and white scores using the ohserved mean and Standard deviarion for all 
NLSY children who took the test. This procedure reduees the estimate of 
the dysgenic effect. For example, rhe results, using sample weights, for the 
children who were 6 and older, showed an increasing B/W gap of 1.9 points 
instead of the 3.9 points produced hy using the national norms. The diffi- 
culty in inrerpreting this finding is that the procedure itself has no good 
rationale. The PPVT national norms seern to have been properly derer- 
mined. If anything, the Flynn effect should mean rhar the NLSY children, 
taking the test anywhere from seven to eleven years after the norms were 
established, should have a 2' to Vpoint IQ edge when compared to rhe na¬ 
tional norms. So we have no reason to think that rhe lower estimate is the 
correct one, hut it does represent the best way we could concoct to mini- 
mize rhe B/W dysgenic effect. 

Final ly, we explored how the hirths to NLSY women might affect these 
findings by comparing black and white women who had not hörne a child 
as of 1990. The mean IQ for the childless white women was 106.6, com¬ 
pared to 100.3 for childless black women. That black women without chil- 
dren have a mean of 100 is in itself striking evidence of the low fertility 
among rhe top part of the black IQ distrihution, hut even if suhsequent fer¬ 
tility for the two groups is the same, the B/W gap in the next generation 
will presumably conrinue to diverge as the NLSY women complete their 
fertility. 

47. New York Times. LSIighting words, fighting words.” heb. 13, 1990, p. A24. 

48. The computation in the text counts each mother as many times as she had 
children who were tested. If instead each mother is counted only once, the 
white-hlack difference among mothers is 1.12 SDs. The white-Latino dif- 
ference is 1.05 SDs. 

49. Auster 1990; Rouvier 1991; Gould 1981; Simon 1989; Wartenherg 1987; 
Wattenherg and Zinsmeisrer 1990. 

50. Holden 1988. 


51. E.g., Higham 1973; Lukacs 1986. 

52. Simon 1989. For a symposium, see Simon et al. 1993. 

53. Auster 1990, and various contributors in Simon et al. 1993. 

54. Bouvier and Davis 1982. This parricular estimate is based on annual Im¬ 
migration of 1 million. 

55. The figures for the 1950s, 1960s, and 1970s were 1 1 percent, 16 percent 
and 18 percent respectively. SAUS 1992, Tahle 14 (SAUS 1971, Table 4). 

56. Lynn 1991. 

57. SAUS 1992, Table 8. The figures also includes once-illegal immigrants who 
were granted permanent residence und er the Immigration Reform and 
Control Act of 1986. 

58. Sowell 1981. 

59. A first, elementary consideration is that the NLSY data refer almost ex- 
clusively to the children of the adults who decided to immigrate. What- 
ever self-selection for IQ might have existed in the elders will he less visible 
in their offspring. 

60. Carliner 1980; Chiswick 1978; Gabriel 1991. 

61. Borjas 1987. Borjas’s formulation also draws on Roy 1951 and Sjaastad 
1962. ln fortheoming papers, Borjas has since extended his analysis 
through the 1990 census, showing a continuation of rhe rrends from 1970 
to 1980. Borjas 1993, 1994- 

62. Borjas 1987, Tahle 3. 

63. Sowell 1981, p. 220. 

64. Borjas 1987, Table 3. 

65. Borjas 1987, p. 552. 

66. The procedure is limited to the NLSY’s cross-sect tonal sample (Le., omit- 
ting the supplemental samples), so that sample weights are no longer an 
issue. Using random numbers, subjeets with IQ scores above 97 had an 
equal chance of being discarded. Because different subsamples could yield 
different results, we created two separate samples with a mean of 97 and 
replicated all of the analyses. The data reported in the tahle on page 368 
represent the average produced by the two replications, compared to the 
national mean as reptesented by unweighted calculations using the entire 
cross-sectional sample. 

67. Cattell 1938, as reprinted in Cattell 1983. 

68. Cattell 1983, pp. 167, 168. 

69. Cattell 1983, pp. 167, 175. 

70. Cattell 1983, pp. 167, 169. 

71. The procedures parallel those used for the preceding analysis of a mean of 

97. 

72. In effect, our sample with a mean of 97 shows what happe ns when people 
with above-average IQs decrease their fertility, and our sample of 103 shows 
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what happens when people with be low-average lQs decrease theirs. WKen 
we changed the NLSY sample so that thc mean feil to 97, we used a ran¬ 
dom variable to delete people with IQs above 97 until the average reached 
97. This did not do much to get rid of people who had the pro hie ms; most 
of its effecr was to diminish the supply of people without prohlems. When 
we changed the NLSY sample so that the mean rose to 103, we were ran- 
dom ly deleting people with IQs below 103. In the course of that random 
deletion, a significant numher of people roward the bottom of the distrib- 
ution—our Classes IV and V—were deleted. Suppose instead we had low¬ 
ered the IQ to 97 by randomly duplicaüng suhjects with IQs below 97. In 
that case, we would have been simulating what happens when people wirb 
helow-average IQs increase their fertility, and the results would have been 
more closely symmetrical with the effects shown for the 103 sample. 

73. These figures continue to be based on the cross-seetional NLSY sample, 
used throughout this exercise. The 1989 poverty rate for the entire NLSY 
sample, calculated using sample weights, was 10.9 percent. 

Chapter /6 

1. A woman was classified as a chronic welfare recipient if she had received 
welfare for at least five years by the 1990 interview. Wo men with incom- 
plere data on AFDC in the years following the hirth of the first child or 
whose first child was born after 1985 were not scored on this variable. 

2. We do not weight the computations for the overrepresentanon of helow- 
average IQ mothers, hut we continue to use sample weights. 

L This represents the mean of the mothers of the NLSY children, with each 
mother counted once for each illegitimate child. Because of the inverse re- 
lationship hetween IQ and the numher of illegirimare children, the mean 
counting each mother of an illegitimate child only once was higher: 89. 

4. As in the case of illegitimacy, IQ and the numher of children of divorced 
and separat ed mothers were inverse ly related. When the mother is counted 
only once regardless of the numher of children, the mean is 94. 

5. See Chapter 10 for a description of this intelligente test (the PP VT). 

Cl\apter 1 7 

1. A briet refresher (see Chapter 4): A heritability of 60 percent (a mid-range 
esrimnte) says that 40 percent of the observed Variation in intelligente 
would disappear if a magic wand wiped out the differentes in those aspecrs 
of the environment that bear on intelligente, üiven that Variante is the 
Standard deviarion squared and that the Standard deviation of IQ is 15, this 
means that 40 percent of 15 J is due ro environmental Variation, which is 
to say that the Variante would drop from 225 to 135 and the Standard de¬ 


viation would contracr tu 11.6 instead of 15 Lf all the environmental 
sources of Variation disappeared. 

2. l A healthy mind in a healrhy body.” Some of the history is recounted in 
Lynn 1990h. Abstraets of a series of studies by Stephen Schoenthaler and 
his associates on the effects of diet on intelligente and on antisocial, crim- 
inal behavior are in Schoenthaler 1991. 

3. Steinetal. 1972. 

4- Lynn 1990h. 

5. Benton and Roberts 1988. 

6. At the age of 12 and 1.3, youngsters 1 scores rise during an eight-month pe- 
riocl in the natural course of events. The dietary Supplement, then, is af- 
fecting the rare of increase of the nonverbal, hut not the verbal, scores. 

7. Schoenthaler et ab 1991. 

8. WISC-R. Block Design, a highly g-Ioaded subtest of WISC-R, showed lit- 
tle or no henefit of the food Supplement. 

9. Earlier work suggesting that reductions in refined sugar increase intelli¬ 
gente are now being reinterpreted as the effecr not of sugar per se hut of 
shifring rhe diet away from foods with little in the way of vitamins and 
minerals to more nutritious foods; see Schoenthaler et al. 1991; Schoen¬ 
thaler Doraz, and Wakefield 1986. The basic point is that we have almost 
no idea of the pathway hetween diet or food Supplements and intellectual 
development; assuming the re is a path, ir could be long and winding. 

10. A child taking a pill that gives, say, one RDA is getting more rhan rhe rec- 
ommended daily allowanees, since the rest of his diet cannot he utterly de- 
void ot vitamins and minerals. 

11. For a failure to confirm an effect ot vitamin-mineral Supplements, see 
Crombie er al. 1990, and for a failure to find an effect on intelligence of 
diet short of chronic malnutrition, see Church and Katigbak 1991. For 
more general discussion of the issue, see Eysenck 1991; Lynn 1990; Yudkin 
1991. 

12. Later children are on the average bom into larger families, which tend to 
be of lower average IQ. Hence, there is a decline with successive births 
that is a by-product of family size in and of itself. However, even after the 
family size effect is extracted, there may be a decline with birth Order. The 
classic demonstration of declining scores with successive births indepen¬ 
dent of family size is a study based on a large sample of Dutch men (Bei- 
mont and Marolla 1973; Beimont, Stein, and Zyhett 1978), Since then, 
subsequent studies have both confirmed and failed to confirm the basic re- 
lationship (e.g., Blake 1989; Retherford and Sewell 1991; Zajonc 1976). 
At present, there is no resolution of the varying findings. 

13. Representative findings, on Japanese twins, are in Takuma 1966, described 
in Iwawaki and Vernon 1988. 
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14. For a review of the literature on twin differences in hirth weight in rela- 
tion ro IQ as well as of other evidence that the uterine environment af- 
fects intelligence, see Storfer 1990. 

15. Achenbach et al. 1990. This study compared two dozen low-birtlvweighr 
babies whose mothers received training in mothering with comparahly 
small groups of normal-weight babies and low-birth-weight babies whose 
mothers did not receive the training. The encouraging outcome is that 
when the children were 7 years old, the usual deficit seems to have been 
forestalled by having trained the mothers in infam nurturing. However, 
the small scale of the study, the lack of random assignment to the three 
groups, and the puzzling near identity in scores for the underweight chib 
dren whose mothers had been trained and the normal children suggest that 
the next Step should to attempt to replicate the finding, as the authors 
themselves say. 

16. For a helpful and balanced introduction to aptitude-treatment interne- 
tions, see Snow 1982. 

17. Hativa 1988. 

18. Atkinson 1974. 

19. Cook er al. 1975. 

20. Coleman et al. 1966. The report talked about educational “aptitude,” hut 
the measures used—vocabulary scores, reading comprehension, nvarhe- 
matical reasoning tasks, etc.—were taken Irom Standard group tests of IQ. 

21. See Mosteller and Moynihan 1972 for a collection of more or less critical 
articles; included also is Coleman’s response to the most intense method- 
ological criticisms (Coleman 1972). The combarants were often trying to 
answer different questions, with Coleman mostly interested in whether the 
objective differences among schools were responsihle for the observed dif¬ 
ferences in abilities and his critics more interested in characterizing the 
objective differences in the schools. We cannot do justice to the ränge of 
issues that surfaced in the report and the subsequent commentary, but onc 
of them deserves mention: The report uncovered evidence that the ethnic 
and socioeconomic mix of students in a school had a larger impact than 
the more Standard investments in per pupil expenditures, teacher salaries, 
quality of physical plant, and the like. This, in turn, became a tnajor argu- 
ment for school busing. Soon after, school busing itself became a barrle- 
ground for social researchers, a tale we will not teil here except to say that 
having a beneficial effect on intelligence is no longer used as an argument 
in favor of busing. 

22. Coleman and Hoffet 1987. 

23. It isn’t hard tofind what seems to be theopposite conclusion in educational 
writings (e.g., the Coleman report is “no longer taken seriously," Zigler and 
Muenchow 1992, p. 62) but noone has been able toshow that the variables 


examined in tbe report accnunt for much of the Variation in cognitive abi l- 
iry among American public school students. If they are in any sense not 
taken seriously, it is presutnably because educational variables other than 
the ones that Coleman studied have been found to be significant. This 
chapter reviews the evidence about those other variables as well. 

24. See Kozol 1992 for a passionate argument rhat disparities in school (und¬ 
ing are a major cause of disparities in educational outcomes. 

25. Husen and Tuijnman 1991. 

26. The quantitative details of the study are not germane to Contemporary 
times, but even then, when schooling varied so broadiy, the direct Unk be- 
tween IQ at: tbe age of 10 and at 20 was a minimum of five times stronger 
than rhat between amount of schooling and IQ at 20, in terms of variance 
accounted for in a path analysis. 

27. Flynn himself does not believe rhat educational equalization per se ac- 
counrs for much of the rise in IQ in some countries such as Holland (Flynn 
1987a), but then Flynn also does not believe that the rising national av- 
erages in IQ really reflect rising intelligence. 

28. Stephen Ceci (1991) has summarized evidence, much of it from earlier in 
tbe Century, for an impact of schooling on intelligence. 

29. National Center for Education Statist ics 1981,Tahle 161,1992,Table 347. 

30. McLaughlin 1977, p. 55. 

11. McLaughlin 1977, p. 53 The failure of such compensatory efforts antedated 
rhe Great Society by many years, however. An early educational researcher 
writing of similar compensatory efforts in 1938 concluded that “whatever 
the numher of years over which growth was studied; whatever the number 
of cases in the several groups used for comparisons; whatever the grade 
groups in which the IQs were obtained; whatever the length of the inter- 
val between initial and final testing; in short, whatever the comparison, 
no significant change in IQs has been found” (Lamson 1938, p. 70). 

32. Office of Policy and Planning 1993. 

33. For more on this distinction, see Adams 1989; Brown and Campione 1982; 
Jensen 1993a; Nickerson, Perkins, and Smith 1985. 

34. “Chicago educator pushes common sense,” St. Louis Post Dispate/i, Dec. 2, 
1990, p. 5D; “Marva Collins still expects, gets much,” St. Petersburg Times, 
July 23, 1989, p. 6A; “Pioneering educator does not want post in a Clin¬ 
ton cabinet,” Minneapoiis Star Tribüne, Oct. 25, 1992, p. 22A. 

35. Spitz 1986. See also “Chicago schools get an education in muckraking,” 
Chicago Tribüne , May 8, 1989, p. IC. 

36. “Fairfax principal, 4 other educators diseiplined in rest-coaching,” Wash¬ 
ington Post , Aug. 7, 1987, p. CI. 

37. “Pressure for high scores blamed in fest cheating,” Los Angeles Times, Sept. 
18, 1988, p. 1. 
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38. “S.L principal said to fudge school scores,” New York Times, Juiy 19, 1991, 
p.Bl. 

39. For a sense of the magnitude of the cheating problem, see “Schools for 
Scandal ” U.S. News & World Report, April 27, 1992, p. 66. 

40. The minister was Luis Alberto Machado, a high official in the mling party 
at the time. 

4L Based on estimates in the preceding years, the children in the two groups 
were chosen to be of comparabie cognitive ability. For descriprions of the 
experiment, see Herrnstein et al. 1986; Nickerson 1986. 

42. The teachers’ manual for most of the lessons, translated into English, is 
available as Adams 1986. 

43. See Brigham 1932 for the relevant background. Rriefly, the SAT was orig- 
inally designed to be an intelligence test targeted for the college-going pop- 
ulation and was originally validated against existing intelligence tests. For 
a modern source showing how carefully the College Board avoids saying 
the SAT measures intelligence while presenting the evidence that it does, 
see Donlon 1984. 

44. Fallows 1980; Slack and Porter 1980; Messick 1980; DerSimonian and 
Laird 1983; Dyer 1987; Becker 1990. 

45. Messick and Jungeblut 1981. 

46. From 1980 to 1992, the SAT-V Standard deviation varied from 109 to 112 
and the SAT-M Standard deviation varied from 117 to 12 3. For the calcu- 
lations, we assumed SDs of 110 and 120, respectively. 

47. McCall 1979. 

48. McCall 1987. 

49. Alexander Pope (in his Mora! Essays) is the poet, and the entire coupler 
is “Tis education forms the common mind; / Just as the ewig is bent rhe 
rree’s inclined.” 

50. See Mastropieri 1987 for a review of the expert consensus on this point. 

51. For a sympathetic rendition of the program and its history, see Zigler and 
Muenchow 1992. For a more critical account, see Spitz 1986. We try to 
keep our account as close to what these two have in common as we can. 

52. “Project Rush-Rush” was what Head Start was called by rhose in Wash¬ 
ington who thought that it was plunging ahead with more speed than de- 
liberation (quoted in Caruso, Taylor, and Detterman 1982, p. 52). 

53. Zigier and Muenchow 1992, reporting the conclusions of Leon Eisenberg 
and C. Keith Connors after the first summer program. Only slightly less 
grandiose were the Claims of raising IQ scores “a point a month” that were 
often cited by enthusiasts. 

54. Sargent Shriver, brother-in-law of the late President, John Kennedy, and 
former head of the Peace Corps. 

55. The first comprehensive evaluarion was the so-called Westinghouse study, 


which the Office of Economic Opportun! ty sponsored. Its conclusion 
was that there were few or no cognitive henefits of Head Start with in 
three years after the child completed it (Oicarelli, Evans, and Schiller 
1969). Soon there was a minLindustry picking over the Westinghouse 
study, in addition to rhe one picking over Head Starr. The consensus 
is now clear: Cognitive gains vanish before the end of primary school, 
e.g., Haskins 1989; McKey 1985; Spitz 1986; Zigler and Muenchow 1992. 
The new consensus has recently surtnced in the populär media (e.g., 
J. De Parle, “Sharp criricism for Head Start, even by friends,” New York 
Times, Mar. 19, 1995, p. Al). 

56. For a ränge of views, see Gamble and Zigler 1989; McKey 1985; Zigler and 
Muenchow 1992. 

57. E.g. Haskins 1989. 

58. Zigler and Muenchow 1992. Edward Zigler, one of the early research di- 
rectors of Head Start and a professor at Yale, argues in his book that it was 
a mistake from the beginning to promise gains in intelligence to the pub¬ 
lic. The more general shift away from making increases in IQ the target of 
preschool programs is discussed in Garber and Hodge 1991; Locurto 1991; 
Schweinhart and Weikarr 1991, pro and con. 

59. Among the people promising gains in the 500 percent ränge is the Presi¬ 
dent of the United States, as reported hy Jason DeParle (“Sharp criticism 
for Head Srart, even hy friends,” New York Times, Mar. 19, 1995). Even 
more of an Optimist is economist Alan Blinder, who once promised a re- 
tum of $4-75 for every clollarspent on preschool education (Blinder 1987). 

60. For a review of such henefits from Head Starr programs, see Haskins 1989, 
who concludes that the results “call for humility” (p. 280). The Head Starr 
literature, he says, “will not support the claim that a program of national 
scope would yield lasting impacts on childrens school performance nor 
suhstantial returns on the investment of public dollars” (p. 280). In short, 
there are no sleeper effects from Head Start. Even the evidence of cost-ef- 
fective returns in the more intensive educational programs is highly re- 
stricted. For a literature review, see Barnett and Escobar 1987. 

61. Most of the children were 3 years old and spent two years in the program; 
the 22 percent who were 4 spent only one year in it (Barnett 1985; 
Berrueta-Clemenr et al. 1984. 

62. Half a school day, or about rwo and a half hours. 

63. The lack of effect was indirectly confirmed in a subsequent study hy the 
samt' group of workers. They failed to find any differential effect on IQ of 
three different forms of preschool: their own cognitive enrichment pro¬ 
gram, a language-enhancing program, and a conventional nursery scho< 
program (Weikarr et al. 1978). There was no conrrol group in this follo 1 
up, so we cannot say how much, if ar all, preschool per se influenced 
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64- For a critical reading üf just how minimal these other effects of preschool 
may have Feen, sec Spitz 1986, 

65. Lazar and Darlington 1982. 

66. Similar estimates can he found in a study ot che early effects of Head Start 
and the consortium sample (Lee er al. 1990). 

67. Lazar and Darlington 1982, p. 47 The people who do these studies ofren 
argue that other positive effects are not heing picked up in the formal mea- 
sureinents (e.g., Ramey, MacPhee, and Yeates 1982). 

68. Many publications have flowed trom the project; usetul summaries are in 
Ramey 1992; Ramey, MacPhee, and Yeates 1982. 

69. Personal eommunieation trom Ron Haskins. 

70. Ramey 1992. 

71. These differences are clearer in the critical aceounts of the project in Spitz 
1986 and 1992 than in the repurt by Ramey, MacPhee, and Yeates 1982. 

72. Herrnstein 1982; Sommer and Sommer 1983. 

7 3. Page 1972; Page and ürandon 1981. 

74- Garher 1988; Garberand Hodge 1991. 

75. Jensen 1989; Locurto 1991. The problem of “reaching to the test” veeurs 
in educational inrerventions. 1t is hased on the tesr’s being less than a per¬ 
fect measure of intelligence (org), so rhar it is possihle to change the score 
without changing the underlying trait (see further discussion in Jensen 
1993a). 

76. Our topic here is the effect of adoption on raising IQ, not the implicatinrvs 
of adoption data for estimating the heritahility of IQ. For revievvs of die 
adoption literature, see Herrnstein 1973; Locurto 1990; Munsinger 1975; 
Plomin and DeFries 1985. A comprehensive theoretical analysis ot adop¬ 
tion studies of intelligence is in Turkheimer 1991. 

77. Brown 1958, Chap. 5; Lane 1976; Lane and Pillard 1978. 

78. Among others inspired hy rhis evidence from “wild children” ot the power 
over the mind of the human environment was an Italian physician trained 
at the end of the nineteenth Century whose approacli to education has sur- 
vived the twentierh, Maria Montessori. 

79. Locurro 1990; Plomin and DeFries 1985. In a refinement of rhis observa- 
tion, it has been found that adopted children also score lower than the chil¬ 
dren in other homes that are socioeconomicaily the same as those ottheir 
adoptive parents hur have no -adopted children (rhereby Controlling for 
posstble ways in which adoptive parents might be distinctive from non- 
adoptive parenrs). 

80. Locurto 1990. 

81. Dumaret and Stewart 1985; Schiff et al. 1982; Schiff and Lewontin 1986. 

82. We will disregard in our analysis a number of considerarions that would re- 


duce estimates of the impact of honte environment, such as that the IQ of 
the schoolmates of the nonadopted half-siblings (who presumahly share 
comparable lower-class surroundings) averaged only seven points less than 
the adopted children, not twelve. This difference raises the possihility rhat 
the adopted-away child seemed hrighter in infancy or had better intellec- 
tual prospects than the half-sihling who srayed at home because of the par- 
ent rhey did not share, or that the shift in home environments was even 
more extreme than the estimates below assume it was, as if the adopted 
child’s biological family home was atypically poor, even for the poor neigh- 
borhoods they were in. This, as we explain below, would reduce the over- 
all estitnate of the impact of home environment. 

8L The cell sizes in the 2x2 tahle of high- and low-SES adopring and hio- 
logical parenr families were only ten children or fewer. 

84- Gapron and Duyme 1989. This study showed an even latger benefit— 
equivalenr to sixreen IQ points—of having high-SES biological parents, 
even when the child was not reared by them, which again points to a her- 
itabiliry greater than .5. 

85. This, it should he remembered, is for childhood IQ, which is more suh- 
ject ro the influence of home environment than adult IQ. Recenr work 
has also indicated that how a parent treats a child (presumahly also an 
adopted child) is in part determined by the child’s inherited eharac- 
teristics. To that extenr, speaking of home environmeni as il it were purely 
an environmental source of Variation is incorrect (see Plomin and Ber- 
geman 1991). 

86. A twenty-poinr swing is easily reconciled with a heritahility of .6 for IQ. 
Suppose the high- and low-SES homes in the French studies represent the 
90rh and lOth centiie of environmental quality, as the text says. A twenty- 
poinr swing in IQ from the 2d to the 98th centiie of environmental qual¬ 
ity would then imply that the Standard deviation of home environment 
effects on IQ is 4.69. Squared, this means a variance of 22 attrihutable to 
home environment. But as we noted in note 1, a heritahility of .6 implies 
that there is a variance of 225 - 135, or 90, attrihutable to environmental 
sources. The French adoption studies, in short, are consistent with the c.on- 
clusion that about a quarter of environmental variance is rhe variance 
across homes (if our guesses ahout the adopting and biological home en¬ 
vironments are not way off). Three-quarters of the environmental influ¬ 
ence on intelligence must he uncorrelated with rhe family SES, according 
to the present analysis. Note again that the Balance rips toward environ¬ 
mental faetors outside families as heing the more relevant than those pro- 
vided by families in affecting IQ, as mentioned in Chapter 4. 

87. For a discussion of cost-benefit considerarions, see Haskins, 1989. 
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C/uipter 18 

1. “Sharpen your pencil, und begin now,” Wall Street Journal, June 9, I 992, p. 
A16. 

2. National Commission on Excellence in Educarion 1983, p. 5. 

3. National Commission on Excellence in Education 1984, p- 38. 

4. For an example of an alarmist view and a discussion of the various esti- 
mates, see Kozol 1985. 

5. National Center for Education Statistics 1992, Table 12-4- 

6. DES 1992, Table 95. 

7- Ravitch and Finn 1987, p. 49. 

8. Congressional Budget Office 1987, p. 16. 

9. Congressional Budget Office 1987, p. 16. 

10. Quoted in Kozol 1985, p. 9. 

11. Four of the studies were conducted by the International Association for 
the Evaluation of Educational Achievement, knovvn as the 1EA. They were 
the First International Mathemarics Study (F1MS), mid- 1960s; rhe First 
International Science Study (FISS), 1966-197 3; the SeeonJ International 
Marhematics Study (SIMS), 1981-1982; and rhe Second International 
Science Study (SISS), 1981-1982. The fifth study was mitiated hy ibe 
United States as a spin-off from NA EP. h was conducted in 1988 and is 
known as the First International Assessment of Educational Progress 
(IAEP-l) (Medrich and Griffith 1992). 

12. Medrich and Griffirh 1992, Appendix B. 

13. National Center for Education Statist ics 1992, pp. 208-21 5. 

14- The best single source for understanding complexiries of international 
eomparisons is the summary and synrhesis produced by National Center 
for Educational Star ist ics (Medrich and Griff ith 1992). Other basic somxes 
in this literarure are Walker 1976; McKnight et al. 1989 ; Keeves 1991 , 
There are culrural facrors too. In his vigorous defense of Americ an educa- 
tion, Gerald Bracey teils of the scene in a Korean classroom during one 
such international rest: “Aseach Korean students name was called tocome 
to the testing area, that child stood and exited the classroom to loud ap- 
plause. What a personal honor to he chosen to perform for the honoi of 
the nation!” American children seldom reuet that way, Bracey ohserves 
(Bracey 1991, p. 113). 

15. Bishop 1993b, National C.'enter for Education Staristics 1992a, pp. 60 -61. 

16. In addirion to Bishop 1989, reviewed helow, see especially Cnrlson, Huch 
skump, and Woodall 1993; Bracey 1991. 

17. Bishop 1989. 

18. The Flynn effect refers to gradually rising scores over time on cognitive 
ahility tests, discussed in Chapter 1 3. 


19. NA EP periodically tests re presen tat ive samples of students at different age 
levels in marhematics, reading, science, and, more recently, in writing and 
in hisrory and lilerature. 

20. National ('enter for Education Staristics, 1991, Fig. 1. The tests were de- 
signed to have a mcan of 250 and a Standard deviarion of 50 when raken 
across all rhree age groups. The exception to flat trend lines was science 
performance among 1 7-year-olds, which shows a fifteen-point decline from 
1969 to 1990, somewhat more than .3 SU (we do not know the specific 
Standard deviutions for 1 7-year-olds on rhe science fest; probably it is less 
than 50). Note also that science among 17-year-olds reflects dispropnr- 
tionately the performance of the above-average students who tend to take 
high school science—-consisrenr wirb out broader theme that educational 
performance dereriorated primarily among the gifted. 

21. Two large quesrions uhout the table on page 422 immediately present 
rhemseh'es. First, are the five studies accurate representations of the na¬ 
tional samples that they purported to select, and are the five tests compa- 
rahle wich euch other? The ans wer to the first half of the quesrion is a 
qualified yes. The studies were not perfect, but all appear to have been well 
designed and executed. The qualification is that the data exclude youlüg¬ 
st ers who did not reach rhe junior year in high schon!. The ans wer to the 
second half of the quesrion is cloudier, if only hecause sets of tests admitv 
isiered ar different times to different samples always introduce incompara- 
hilities wirh effects that cannot he assessed precisely. The prüdent 
conclusion regarding the math scores is to discount the modest fall and rise 
from 1955 to 1983 and assume instead that math aptitude over that period 
was sready. Regarding the Verbal scores, ir seems likely that they rose from 
1955 to 1966 and dropped from sometime after 1966 to sometime helween 
1974 and 1983, with the magnitude and precise timing of tbose sh ifrs still 
open to quesrion. Before leaving the norm srudies, we must add a proviso: 
rhe SAT scales got easier during 1963 to 1973 by about eight to thirteen 
points on rhe Verbal and perhaps ten to seventeen points on rhe Math. 
They seem to have been stable before and following this period (Modu and 
Stern 1975, 1977). The same person would, in other words, have eamed a 
higher score on rhe later SATs than the earlier ones, owing pure ly to 
changes in the test scales themselves. Whether the PSAT, a much shorter 
test, experienced the same degree of drifr is unknown, hut it is a good idea 
to adjust mentally the 1974 and 1983 scores downvvard a hit, though this 
does not change the overall interpretation of the results. 

22. Grades 10 and 11 show a similar pattem. Grade 12 remained slightly un- 
der irs high (1965-1967) as of 1992, hut it is likely that rhe deficit is ex- 
plained by increases in the proportion of 17-year-olds retained in schoob 
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The possihility remains open, however, that education in the post-slump 
period improved rnore in the lower gradcs than in the higher ones. 

23. Congressional Budget Office 1986. 

24- Med rieh and Griffith 1992. 

25. The College Board added new method of reporting test scores in 1967 
based on seniors instead of all tests administered, and continued to reporr 
the means for both types of samples through 1977. Düring the years when 
both scores were available, the trends were visually almost indisringuish- 
able. In the year when we employed rhe new measure in the graph on page 
425, 1970, the scores for the two methods were idenrical. 

26. Based on the 1963 Standard deviations, .49 and .32 SD reducrions respec- 
tively. 

27. For a technical Statement of rhis argument, see Carlson, Huelskamp, and 
Woodall 1993. 

28. Readers can follow the journey through the numbers in Murray and 
Herrnstein 1992. 

29. lt is possible that the SAT pool was not getting democrarized in rhe usual 
socioeconomic sense but was nevertheless beginning to dig deeper into rhe 
cognitive distribution. Responses in the SAT Student questionnaire indi- 
cate that somewhat more students trom the bottom of the dass were rak- 
ing the test in 1992 than in 1976, but this eftect was extreme ly small for 
whites. ln 1980, 72.2 percent of whites reporred that they were in the top 
two-fifths of theirhigh school dass, compared to 71.5 percent in 1992. We 
nonetheless explored rhe possibility that the pool had hecome cognitively 
democrarized, hy looking at the scores of students who reported that they 
were in the top tenth, the second tenrh, and the second fifrh of their dasses. 
If their scores went up while those for the entire SAT sample went down, 
that would be suggestive evidence (if we make certain assumptions abour 
the consistency with which students reported their rrue dass rank) that 
the pool was drawing from a cognitive ly broader segnient of the popula- 
tion. Using 1980 (the end of the decline) to 1992 as the period of com- 
parison, the Verbal scores of whites who reported they were in the top 
tenth, 2d tenth, and 2d fifth went up by five, seven, and eight points re- 
spectively, while that of the entire white SAT pool remained flat, ln Math, 
the scores of rhe top tenth, 2d tenrh, and 2d fifth went up hy nine, thir- 
teen, and fourreen points, respectively, while that of the pool rose hy nine 
points. At first glance, rhis would seem to he evidence for a srrong effect 
of cognitive democratization. Bur then we looked at what happened to the 
scores of white students reporting that they were in the 3d, 4rh, and lovv- 
est fifrhs of their dasses. Their scores went up by much more: nine, eleven, 
and ton points, respectively, in the Verbal; seven teen, seven teen, and nine 
in rhe Math. We are avvare of Simpsons paradox, which shows hovv scores 


in each interval can go up when scores in the aggregated group go down, 
hut in this case the explanation appears to lie in changes either in the way 
that students report their dass rank, the meaning of dass rank, or borh. 
We give “cognitive democratization” credit for two points each in the Ver¬ 
bal and Math, hut it is not ccm - ta in that even that much is warum red. 

30. For an argument that the test score decline does in fact represenr falling 
inteltigence, see Itzkoff 1993. 

31. For a broader discussion of falling SAT scores in the high-scoring segment 
of rhe pool, see Singal 1991. 

32. From 1967, scores were reported for all test takers; from 1972 through 1976, 
ETS reporred scores for all test takers and for college-hound seniors. To es- 
rimate college-hound seniors for 1967-1972, we com put cd the ratio of col- 
lege-bound seniors ro total test takers for the overlapping years of 
1972-1976. For Verbat, the mean ratio was .82, with a high of .88 and a 
low of .77. For Math, rhe mean ratio was .78, with a high of .85 and a low 
oi .71. The mean ratios were applied to the data trom 1967 to 1972 ro olv 
tain an estimare of the number of college-hound seniors. 

33. ETS keeps careful watch on changes in item difficulty, which are ealled 
“scale drift.“ lt finds that scores ot 650 and ahove were lirrle aff ec red hy 
seale drift (Modu and Stern 1975; 1977). 

34- The remaining possibility is that the increase in the SAT pool during the 
1980s hroughf students into the pool who could score 700 but had not heen 
taking the test before. This possibility is not subject toexamination. lt must 
he ser against the evidence that extreme ly high propottions ot the top stu¬ 
dents have heen going to College since the early 1960s and that the best- 
of-rhe-best, represented by those who score more than 700 on the SAT, 
have heen avidly seeking, and being sought by, elite Colleges since the 
1950s, which means that they have heen taking rhe SAT. Note also that 
rhe proportion of SAT students who identify themselves as being in the 
top tenth of their high school dass—where 700 scorers are almost certain 
to be—was virtually unchangecl from 1981 to 1992. Finally, if highly tal- 
ented new students were being clrawn from some mysrerious source, why 
did we see no improvement on the SAT-Verbal ? Ir seems unlikely that the 
increase in rhe overalt proportion of high school students taking the SAT 
can account for more than a small proportion, if any, of the remarkahle im- 
provement in Math scores among the most gifred during the 1980s. 

35. Once again, rhe changes are not caused by changes in the ethnic cornpo- 
sition of the pool (for example, hy an influx of test takers who clo not speak 
English as their native language). The trendline for whites since 1980 par- 
allels that for the entire test population. 

36. National Center for Education Statistics 1992, p. 57. We also examined 
rhe SAT achievemenr test results. They are harder ro inrerpret than the 




752 Notes to page 430 


Notes to pages 430-437 753 


SATs because the test is regularly rescaled as the popularion ofstudents 
taking the test changes. For a description of the equating and rescaling pn> 
cedures used for the achievement tests, see Donlon 1984, pp. 2]-27. The 
effects of these rescalings, which are too complex to descrihe here, are suh- 
stantial. For example the average Student vvho took the Biology achieve- 
ment test in 1976 had an SAT-Math score that was 71 points ahove the 
national mean; by 1992, that gap had increased to 126 points. The same 
phenomenonhas occurred with mostof'theother achievement tests (Math 
11, the more advancedof the two mach achievement tests, is an exception). 
Put roughly, the students who take them are increasingly unrepresentative 
of the college-bound seniors who take the SAT, 1er alone of the national 
population. We focused on the students scoring 700 or higher by again as- 
suming that since the 1960s, a very high proportion of the narions stu¬ 
dents who could score higher than 700 on any given achievement test took 
the test. We examined trends on the English Composition, American His- 
tory, Biology, and Math II tests front three perspectives: the students scor¬ 
ing ahove 700 as a proportion of( 1) all students who took that achievement 
test; (2) all students who took the SAT; and (3) all 17-year-olds. Merhod 
1 (as a proportion ofstudents taking the achievement test) revealed flat 
rrendlines—not surprisingly, given the nature of the rescaling. Methods 2 
and 3 revealed similar patterns. With all the reservations appropriate to 
this way of examining what has happened, we find that the proportion scor¬ 
ing ahove 700 on English Composition and Math 11 mirrored the contrast 
we showed for Verbal and Math scores on the SAT: a sharp drop in the 
English Composition in che 1970s, with no recovery in the 1980s; an 
equally sharp and steep rise in the Math 11 scores beginning m the 1980s 
and conrinuing through the 1992 test. The resulrs for American 1 listory 
and Biology were much flatter. Method 2 showed no consistent trend up 
or down, and only minor movement in either directum at any time. 
Method 3 showed similar shallow bowl-shaped curves: reductinns durmg 
the 1970s, recovery durmg the 1980s that hrought the American Hisrorv 
results dose to the first year of 1972, and hrought Biology to a new high, 
alrhough one that was only fractionally higher than the 1972 resulrs. This 
is consistent with a hroad theme that the Sciences and math improved more 
in the 1980s than the humanities and social Sciences did. 

37. Diane Ravitchs account, one of the first, is still the best (Ravitch 1983), 
with Firm 1991; Sowell 1992; Ravitch 1985; Boy er 1983; and Porter 1990 
providing perspectives on different pieces of the puzzle and gmdance ro the 
voiuminous lirerature in magazines and Journals regarding rhe educarional 
changes in elementary and secondary schools. For hasic texts hy advocates 
of rhe reiorms, see Coodman 1962; Kohl 1967; Silherman 1970; Kozol 


1967; Featherstone 1971; Illich 1970; and the one that in some respects 
started it all, Neiü 1960. 

38. Fiske 1984; Gionfriddo 1985. 

39. Sowell 1992, P . 7. 

40. Bishop 1993h. 

41. Bejarand Blew 1981; Breland 1976; Etzioni 1975; Walsh 1979. 

42. By the early 1980s, when the worst of the educarional crisis had already 
passed, the High School and Beyond survey found that students averaged 
only three and a half hours per week on homework (Bishop 1993h). 

43. DES 1992h, Table 132. 

44- DES 1992b, Table 129, The picture is not unambiguous, however. Mea- 
sured in “Carnegie units,” representing one credit for the completion of a 
one-hour, one-yearcourse, high school graduates were still gettingasmaller 
proporrion of their education from academic units than from vocational 
or “personal“ units (National Center for Education Statistics 1992, p. 69). 

45. We do not exempt Colleges altogether, but the re are far more exceptions 
ro rhe corruption as we mean it at the university level than at the high 
school level, in large part because high schools are so much more shaped 
hy a few scandardized textbooks. 

46. Gionfriddo 1985. 

47. Irwin 1992, Table 1. The programs we designated as for the disadvantaged 
were the Title 1 hasic and concentration grants, Even Start, the programs 
for niigratorv children, handicapped children, neglected and delinquent 
children, the rural technical assistance Centers, the state block grants, in- 
expensive hook distribution, the Eilender fellowships, emergency Immi¬ 
grant education, the Title V (drug and alcohol abuse) state grants, national 
programs, and emergency grants, Title VI (dropout), and bilingual program 
grants. 

48. DES 1992h, Tahle 347. 

49. Calvin Lockridge, quoted in “Old dehare haunts Banneker’s future,“ Wash¬ 
ington Pust, March 29, 1993, p. A10, 

50. Ihid. 

51. Bishop 1993h. 

52. For a coherent and attractive list of such reforms, see Bishop 1990h. 

53. Stevenson er al. 1990. 

54- E.g., 63 percent of respondents in a recent poll conducted hy Mellman- 
Lazarus-Lake for the American Association of School Administrators 
thoughi that the nation s schools needed “major reform,“ compared to only 
33 percent who thought their neighborhood schools needed major reform. 
Roper Organization 1993. 

55. E.g., Powe 11, Farrar, and Cohen 1985. 
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56. Bishop has developed these arguments in several studies: Bishop 1988h, 
1990a, 1990b, 1993a, 1993b. 

57. Bishop 1993b (p. 20) cites che example of Nationwide Insurance, which 
in the single year of 1982 sent out over 1,200 requests for high schon! trän- 
scripts and got 93 responses. 

58. Bishop 1988a, 1988b, 1990a, 1993a, 1993b. 

59. Bishop 1990b. 

60. Ibid. 

61. The Wonderlic Personnel Test tits this description. For a description, see 
E. F. Wonderlic & Associates 1983. The value of a high school transcript 
applies mainly to recent high school graduates who have never held a job, 
so that employers can get a sense of whether this person is likely to come 
to work every day, on time. But after the first job, it is the job reference 
that will count, not what the Student did in high school. 

62. The purposes of such a program are primarily to pur the federal govern- 
ment f our-square on the side of academic excellence. lt would not appre- 
ciably increase the number of high-scoring studenrs going to College. 
Almost all ot thern already go. But one positive side effect would be to ea.se 
the financial bürden on many middle-class and lower-middle-class parents 
who are too rieh to qualify for most scholarships and roo poor to send their 
ehildren to private Colleges. 

Chapter 19 

1. Quotas as such were ruled illegal by the Supreme Court in the famous Bakkc 
case. 

2. Except as otherwise noted, our account is taken from Maguire, 1992. 

3. A. Pierce et ab, “Degrees of success," Washington Post,May 8, 1991, p. All. 

4. Seven COFHE schools provided data on applicants and admitied studenrs, 
hut not on matriculated studenrs. Those schools were Barnard, Bryn Mawr, 
Carleron, Mount Holyoke, Pomona, and Smith. The ethnic differences in 
scores of admitied studenrs for these schools were in the same ränge as the 
differences for the schools shown in the figure on page 452. Yale did not 
supply any data hy erhnicity. Data are taken from Consortiinn on Financ- 
ing Higher Education 1992, Appendix D. 

5. “Best Colleges,” US. News & World Report, Oa. 4, 1993, pp. 107.27. 

6. Data for the University of Virginia and University ot California at Berke¬ 
ley are for 1988 and were obrained from Sarieh 1990 and 1.. Feinherg, 
“Black freshman enrollmenr rises 46% at U-Va," Washington Post, De- 
cemher 26, 1988, p. CI. 

7. The figures for Standard deviations and percentiles are hased on the 


COFHE schools, omitting Virginia and Berkeley. The COFHE Redbook 
provides the SAT scores for the mean, 25th percentile, and 75th percentile 
hy school. We computed the estimated Standard deviation for the com- 
bined SATs as follows: 

Estimated Standard deviation for each test (Verbal and Math): given the 
scores for the mean and any percentile, the corresponding SD Ls given by 
(x-'tn)/z, where x is rhe score for the percentile, m is the mean, and z is 
the standardized score for that percentile in a normal distribution. Two 
separate estimates were computed for each school, based on the 25th and 
75th percentiles. These two estimates were averaged to reach the esti¬ 
mated Standard deviation for each test. 

The formula for estimating the Standard deviation of combined tests is 
\jö\ + o*; + 2 r Oj 0 ? » wbere r is the correlation between the two tests 
and er represents the Standard deviation of the two tests. The correlation 
of the verbal and math SATs as adtninistered to the entire SAT popula- 
rion is .67 (DonIon 1984, p. 55). The correlation for elite schools ismuch 
smaller. For purposes of this exercise, we err on the conservative side by 
conrinuing to use the correlation of .67. We furrher err on the safe side 
by using the Standard deviation for the entire Student population, which 
is inflated hy the very affirmative action admissions that we are analyz- 
ing. lf instead we were to use the more appropriate haseline measure, the 
Standard deviation for the white students, the Harvard Standard devia¬ 
tion (known from unpublished data provided hy the Admissions Office) 
would be 105 instead of 122. For both reasons, the analysis of the gap be¬ 
tween minority and white students in the COFHE data is understated. 
To give an idea of the magnitude, our procedure underestimated the 
known black-white gap at Harvard by 14 percent. 

8. The Berkeley figure for Latinos is an unweighred average of Chicanos and 
other Latino means. 

9. Scholars who have tried to do work in this area have had a tough time ob- 
taining data, up ro and including researchers from the Office for Civil 
Rights in the Department of Education (Chun and Zalokar 1992, note, p. 
108). 

10. The Berkeley figure for Latinos is an unweighted average of Chicanos and 
other Latino means. For Davis, only a Chicano category is broken out. Vir¬ 
ginia had no figure for Latino students. 

11. Chun and Zalokar 1992. 

12. Committee on Minority Affairs 1984, p. 2. 

13. Chan and Wang 1991; Hsia 1988; Li 1988; Takagi 1990; Bunzel and Au 
1987. 
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14. K. Gewertz, “Acceptance rate increases ro 76% for dass of 1996,” Harvard 
University Gazette May 15, 1992, p. 1. 

15. F. Butterfield, “Colleges luring black students with incenrivcs,” New York 
Times , Feb. 28, 1993, p. 1 

16. For Chicano and other Latino students at Berkeley, ehe comparutive Posi¬ 
tion with whites also got worse. SAT scores did not rise significantly for 
Latino students during the 1978-1988 period, and rhe net gap increased 
from 165 to 254 points for the Chicanos and from 117 poims to 214 points 
for other Latinos. 

17. Powers 1977, as reported with supplementary analysis in Klitgaard 1985, 
Table A1.6, p. 205. 

18. The 12-15 ränge cutsoff the upper 11.5 percent, 14.9 percent, and 7.5 per- 
Cent of matriculants with known MCAT scores for the biological Sciences, 
physical Sciences, and verbal reasoning tests respectively. By way of com- 
parison, the top 10 percent in the SAT-Math in 1993 was a lirtle above 
650; in the SAT-Verbal, in the high 500s. 

19. Shea and Fullilove 1985, Table 4, reporting 1979 and 1983 data, indicate 
that blacks with MCAT scores in the 5—7 ränge had approximately rvvice 
the chance of admission of white students. ln another glimpse, a multi- 
variate analysis of applicants to medical school from among rhe under- 
graduates at two University of California campuses (Berkeley and Davis) 
during the last half of the 1970s began with the average white male appli- 
cant, who had a 17.8 percent chance ot being admitred. Holding other 
characteristics constant, being black raised the prohahility of admission to 
94.6 percent. Being an American Indian or Chicano raised rhe probahil- 
ity to 95.0 percent (Olmstead and Sheffrin, 1980a). An Asian with lden- 
tical age and academic credentials had a 25 percent chance of admission, 
higher than the white prohahility but not Statistical ly significantly so. 
Williams, Cooper, and Lee 1979 present the odds from rhe opposite per¬ 
spective. A study of ten medical schools hy the Rand Corporation tound 
that a mmority Student with a 50 percent chance of admission woukl have 
had about a 5 percent chance of admission if he were white with the sanic 
qualifications. 

20. Klitgaard 1985. 

21. Proponents of affirmative action commonly eite preference for children of 
the alumni and students from distant States as a justificarion for affirma¬ 
tive action. Civen the size of the racial discrepancies we have reported, it 
would be useful to have an open comparison of the discrepancies associ- 
ated with these other forms of preference. We have found data from only 
one school, Harvard, where the legacy of having a Harvard parent coiv 
tinues to be a plus in the admissions process hut small in terms of test scores. 
For the decade starting in 1983, the average Verbal score of alumni chil¬ 


dren admirted to Harvard was 674 comparecl to 687 eamed by the ad mit- 
red children of nonalunini; for Math scores, the comparable scores were 
695 versus 718, respectively. Office of Civil Rights 1990. 

22. Higham 1984. The arguments againsr admitting Jews were likely to men- 
tioti that gentile families might not send their children to a College with 
“too many” Jews (institutional self-interest) or that antl-Semirism wt)uld 
make it hard for Jewish alumni to use their College educarion for society’s 
welfare (social utility). 

23. Berger 1987. 

24. Lloyd 1990; Peiler 1991. 

25. The formal explication of this Standard is Thorndike 1971. For a discus- 
sum ot how slippery the notion ot “acceptahle" pertormance can he, see 
Brown 1980. 

26. The comparisons are hased on NLSY suhjects who wem to the same four- 
year Colleges and universities (again, excluding historically black schools). 
Excluding junior Colleges eliminates problems of interpretation it different 
proporrions of different ethnic groups arrended junior Colleges rather than 
four-year insrirutions. Since the framework for the analysis assumes a mul- 
riracial eampus, it seerned appropriate to exclude the 103 NLSY suhjects 
(all but 6 of whom were black) who arrended historically black institu- 
tions. For the record, the mean AFQT score of black students who first at- 
tended historically black institutions and blacks who first attended other 
four-year institutions were with in two IQ points of each other. 

27. We used the top and bottom half of socioeeonomic Status rather than a 
more restrictive definirion (such as the top and bottom quartile) to give 
large enough sample sizes for us ro have contidence in the results. When 
we used rhe more restrictive definirions, the results showed admissions de- 
cisions that were even farther out of line with the rationale, hut with small 
samples numbering just 15 pairs for two of the cells. The procedure for rhe 
analysis was as follows: The NLSY Lncludes the Fl CE (Federal Interagency 
Committee on Edueation) codetor each Institution the NLSY suhjects at- 
rended. This analysis is based on the first such institurion attended atter 
high school. The matching procedure somerimes creares multiple lines for 
one member of rhe pair. For example, suppose thar three whites and one 
black have attended the same school. One may either enter the black score 
three times or eliminate duplicates, entering the black score only once. We 
consider that the elimination of duplicates is likely to introduce more er¬ 
ror, on the assumption that the differences among Colleges can be large. 
Imagine a sample consisting of two schools: an unassuming state teachers’ 
College, with three whites and three blacks in the NLSY sample, and Yale, 
with three whites and one black. The Yale scores are much higher than the 
teachers College scores. Eliminaring duplicates—entering just one (high) 
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black score für Yale instead of the same score three times—would defeat 
the purpose of matching schools. The figures reported in the texr are thus 
based on means that have counted some people more than once huf con- 
trol for institutional effects. The mean used to compute a cell entry is the 
intercept of a regression in which the dependent variable is IQ score and 
the independent variables are the institutions, coded as a vector of nomi- 
nai variables. Note that we also reproduced this analysis eliminating dm 
plicates. The results are so similar that the alternative numbers could be 
inserted in the text without requiring the change of any of rhe surround- 
ing discussion. 

ln addition to this form of the analysis, we examined other vvays of ent¬ 
ring off low and high socioeconomic Status, ranging from the most general, 
which divided the Jeciles into the top and bottom five, to the most ex¬ 
treme, which considered only the top and bottom deciles. For the lauer 
analyses, we used the entire sample of NLSY students who attended four- 
year institutions, to preserve large enough sample sizes to analyze. Those 
results were consistent with the ones presented in the text. A positive 
weight attached to being black until reaching the most extreme compari- 
son, of a white Student in rhe bottom socioeconomic Status decile com- 
pared to a black Student in the top decile, at which point the edge for the 
black Student feil to dose to zero (but never actually reached zero). We für- 
ther examined the results when the sample consisted of NLSY subjeets who 
had received a bachelors degree (not just attended a four-year College). 
The pattem was identical for both blacks and Latinos, and even the mag- 
nitudes of the ditferences were similar except that, as in other replicarions, 
the gap hetween the disadvanraged white and disadvanraged black grew 
substantially over the one reported in the text. 

2h. The computation, using IQ scores, was (black mean - white mean)/(SD 
of all whites who attended a four-year institution as their first College), ln 
understanding the way that affirmative action operates, we take it that rhe 
reference point is the white Student population, which indeed squares with 
most qualitative discussions of rhe issue, pro and con. 

29. Perhaps “low SES” for blacks meant a much worse backgrounj than “low 
SES” for whites? Not by much; the means for both groups were dose ( 5 Ist 
percentile for whites, 25th for blacks), and Controlling for the difference 
did not appreciably change the story. Nor did ir do any good to try to de¬ 
hne high and low SES more strictly, such as people in the top and bor- 
tom quartiles. In that case, the disadvantaged blacks were admirted with 
even lower lower scores than disadvanraged whites, in the region of 1.5 
Standard deviarions (depending on the specific form of the analysis)— and 
soon through thecells in the table. 


30. We use this indirect measure because other more direct measutes (e.g., the 
numher of blacks enrulling in College out of high school, or the number of 
persons ages 20 to 21 enrolled in school) do not go back to rhe 1960s and 
1950s. 

From 1950-1969, data are available only for “blacks and others.” Over- 
lapping data indicate that the figure for “blacks only” in the early 1970s 
was stähle at approximately 95 percenr of rhe “blacks and other” figure. 
The data for 1950-69 represent the “blacks and other” numbers multiplied 
hy .95. If one assumes that the pruporrion was somevvhat higher in the 
1950s and early 1960s, this produces a fractional overestimate of the up- 
ward black trendiine, but so small as to he visually imperceptibie in the 
graph on page 469. 

31. Carter 1991; D’Souza 1991; Sowell 1989; Sowell 1992; Steele 1991. 

32. See, lorexample, Sarich 1990; Lynch 1991. 

35. For a review of this literature through the 1970s, sec Breland 1979. Re¬ 
search since then has not ehanged the picture. See also Linn 1983; Don- 
Ion 1984, pp. 155-159. 

34. As in so many matters involvmg affirmative action, this indirect reason- 
ing would be unnecessary if Colleges and universities were to open their 
data on grades to researchers. 

35. Althach and Lomotey 1991; Bunzel 1992; D’Souza 1991. 

36. E.g., Carter 1991; Steele 1991. 

37. National Center for Education Sraristics 1992, Tables 170, 249. In the 
NLSY sample, among all students who first entered a four-year nonblack 
uni Vers ity, 27 percent of the whites failed to get a bachelors degree 
compared to 57 percent of the blacks and 55 percenr of Latinos. “Dropout” 
in the NLSY is defined as having taited to have completed a hachelor’s 
degree hy the 1990 interview, despite having once entered a four-year 
College. Ry that time, the youngest memhers of the NLSY were 25 years 
old. 

38. The real discrepancy in dropout rates involved Latinos. Using the same 
analysis, the probability that a Latino Student with an IQ of 110 would get 
a hachelor’s degree was only 49 percent. These results are produced when 
rhe analysis is run separately for each race. 

39. A. Hu, “Huls on first,” Asian Weck, May 1 2, 1989, p. 7; Consortium on Fi- 
nancing Higher Education 1992. 

40. A. Hu, “Minorities need more Support,” The Tech , Mar. 17, 1987, p. 1. 

41. Carter 1991; Sowell 1992; Steele 1991; D’Souza 1991; Murray 1984. 

42. There should probably also he some contraints on the spread of the abil- 
ity distrihutions in various groups, but such specificity would he out of place 
here. 
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Chäpter 20 


1. This Statement assumes thar the violarion of the 80 percent mit* is statis- 
rically significant. With sufticiontly small numhers of hirees or promotions, 
these percentages will fluctuate widely hy ehance. 

2. The Uniform üuidelinesare just guidelines, not laws. Iiume notable 1982 
case (Connecticut v. Teal), the Supreme Court ruled that even rhe praetice 
of meeting the 80 percent rule hy hiring larger numhers of fest passers from 
the protected than from the unprotected groups still falls short if the rest 
producesdisparate impact. Disparate impact, in and of itself, said the Court 
in Teal, deprives proreered applicanrs of equal oppormnity, even if the Jis- 
proporrionate numhers are corrected at rhe hottom line. Under this ruling, 
an empLoyer who hires a given number ol hlacks will he violaring rhe law 
if the blacks have high ability test scores, but not violaring the law if rhe 
same number of blacks are hired without recourse lo the scores at all, and 
thtts are hound to have lower scores on average. This eventualiry was 
lauded hy Kelman 1991, who argues (p. 1169) that hiring a larger Propor¬ 
tion of test-passing blacks than resr-failing blacks "stigmatizes" blacks be- 
cause it iinplicitly validates a fest on which blacks on average score below 
whites. Retter, he suggests, not to test at all, tacitly assuming that rhe test 
has no predictive power worth considering. For anorher view o (Teal see 
Epstein 1992. 


3. The Hartigan Report is discussed in Chaprer 3. 

4. E.g., Kelman 1991. 


5. Heckman and Payner 1989, p. 138. 

6. The categories are based on those defined hy the federal government. The 
professional-technical caregory was chosen to represent high-status jobs. 
The Jerical caregory was chosen borh to represent lower-status skilled jobs 
and also because, among those categuries (others are sales workers and rhe 
crafr workers), clerical is the only caregory thar shows a visihly steeper in- 
crease afrer 1959 than betöre it. Two technical points about the graph on 
page 485 are important. First, the joh Classification System used hy the C len 
sus Bureau was altered in 1983. Figures for 1983-1990 conform to the Clas¬ 
sification sysrem in use from 1959-1982. 'The professional-technical 


caregory for 1983-1990 consists of the sum of the headings of “Professional 
specialry," “technical, sales, and administrative support,” “accountants and 
auditors,” and “personnel, rraining, and labor relations specialists.” The 
clerical caregory consists of the sum of “administrative support, including 
clerical, and cashiers. Second, the data in rhe graph are for blacks only, 
corrected for the “blacks and others" enumerarion that was used unril 1973. 
The correction is based on the knmvn ratio of jobs held hy the “others” in 
“hlacks and others” for overlapping data as of 1973. This assumes thar rhe 


“others” (mostly Asian) held a eonstant proportion of clerical and Profes¬ 
sional jobs held hy “blacks and others” from 1959-197 5. If in fact the pro¬ 
portion went down (hlacks acquired these jobs dispmporrionately), then 
the pre-197 5 line in the graph slightly underestimates the slope of the black 
increase. If in fact the proportion went up (the “others” acquired these jobs 
disproporrionately), then the pre-1973 line in the graph slightly overesti- 
mates rhe slope ot the black increase. Note, however, that even asof 1975, 
blacks constituted 87.9 percent of the “black and other” population ages 
18 and over, compared to 91.9 percent in I960, so rhe degree ot error is 
unlikely lo he visually perceprible in the graph. The alternative was roshovv 
“hlacks and others” consistently from 1 959 into rhe 1990s, but from a tech¬ 
nical perspective this becomes inereasingLy inaccurare as rhe percenrage 
of “others” increases rapidly in the 1970s and 1980. Visually, graphs pre- 
pared under ei dt er method show the same story. 

7. The main complications are, first, that rhe affirmative action policies 
evolved over a period of time, so that the landmark events are not as de- 
cisive as they may appear ro be (see Appendix 7). Second, laws and regu- 
lations often institutionalize changes that were already under way for other 
reasons. This seems to he clearly the case with the hiring of minoriries, and 
ir, too, tends to blunt the impact of rhe laws and regulations when rhey 
comc along. Third, different regions of the country prohahly reacted to the 
laws and regulations difterenrly, therehy diluting their impact in national 
star ist ics. 

8. Donohue and I leck man 1991; Epstein 1990; Freeman 1984; Heckman and 
Payner 1989; Heckman and Verkorke 1990; Leonard 1986; Welch 1981. 

9. I5mwn and Erie, 1981 concluded that abour 55 percent of the increase in 
black managerial, Professional, and technical employment from 1960 to 
1976 oceurred in the public seetor. 

10. The classic exchange on this topic is Epstein 1992, Chap. 12; Heckman 
and Payner 1989. 

11. The normative 1 Standard deviation difference is assumed for rhis exercise. 
The ohserved difference in the N LS Y is larger, he nee would only exacer- 
hate the conclusion suggested by the graphic on page 485. 

12. Übviously, there will be employees who fall outside rhe ränge. Rur insofar 
as the tails at both ends are small and roughly equivalent, rhe calculation 
is not inuch affected. These particular numhers are based on the ohserved 
distribution of NLSY whites in these job categories. For clerical jobs, 90 
percent of all white employees had lQs between 85.7 and 122.7, with a 
Standard deviation of 11.5. For Professional and technical jobs, 90 percent 
of all white employees had IQs of 98.0 and above, with a Standard devia¬ 
tion of 11.8. 
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13. The assumptions used for the figure are extremely conservarive. Mosr oh- 
viously, the Standard deviation of 15 is roo high. People within an occu- 
pational category will always tend to have a smaller dispersion than the 
general population. Ifwe change nothing excepr reduce the Standard de- 
viations to 12 for both blacks and whites, in line with the obsctvcd stan- 
dard deviations in the NLSY, the black-white ratios rise from 1.7 
(professionai'technical) and 1.6 (clerical) to 2.5 and 1.9 respecrively. In 
addition, however, the graph on page 490 is conservative in using an IQ 
ränge that encompasses 90 percent of the white workers in an occuparional 
category. The lower the bottom end of the ränge is, the more it dispropor- 
tionately inflatcs the eligible portion of the black population (changes in 
the top end of the ränge are at the tail of the distrihurion and add very lir- 
tle to the eligible pool). Visualize the bell curve: By lowering the bottom 
cutoff for professionai'technical professions from 100 to 98 (for cxample), 
everyone in that very tat part of the curve is treated as being just as eligi- 
ble for a professionai'technical occupation as anyone eise—even thotigh, 
in reality, thev are much less likely than persons with higher IQs to get 
such jobs. If, for example, we base the ränge on the IQs that emhrace 80 
percent of the white workers in an occupation—more realisric in rnany re- 
specrs—the black-white ratio in 1990 grows to 2.3 for Professional- 
technical occupations and 1.8 for clerical. But the conciusions still hold 
even it we broaden the ränge still further than in the graph, to emhrace 95 
percent of all people in those occupations. ln that case—which assumes, 
implausibly, that all people with IQs higher than 89.8 are equally likely to 
be hired for technical-professional jobs and that all people with IQs he- 
tween 82.0 and 1 30.3 are equally likely to he hired for clerical jobs—the 
black-white ratio as of 1990 is still greater than 1 in hoth instantes: 1.2 for 
professionai'technical, 1.5 for clerical. ln short, the differentes produced 
by altering the assumptions can make substantial differences in the size of 
the estitnates of disproportionate hiring, but even assumptions that go well 
beyond common sense and the available data do not clvange the overall 
conciusions drawn in the text. 

14. The observarions using the CPS and the NLSY are not completely inde¬ 
pendent, insofar as we took out estimate of the IQ ränge for clerical and 
Professional-technical occupations from the data on NLSY whites. But 
those parameters did not constrain the results for blacks. 

1 5. The sample in these analyses exduded persons who were still in scliool in 
1990. 

16. Jaynes and Williams 1989, Tables 4-1,6-1. 

1 7. Hartigan and Wigdor 1989. See also Chapters 3 and 13. 

18. As of 1987, States had such a certification process. See Rudner 1988. 

19. Straus and Sawyer 1986. 


20. Lerner 1991. 

21. In Pennsylvania, with the highest pass rates, the state commissioner of 
higher eduearion openly acknowledged that Pennsylvania sought to avoid 
lawsuits alleging racial bias in the test by establishing a low cutoff score 
that they would subsequently try to raise. See H. Collins, “Minority groups 
are still lagging on teacher’s exam,” Philadelphia Inquirer , Aug, 5, 1989, p. 
Bl. 

22. The answer to the question of how such large differences can show up in 
otherwise credentialed teachers is, in effect, the topic of the preceding 
chapter, on affirmative action in higher education. 

2 h It we make the empirically more likely assumption that IQ does have a pos- 
itive correlation with the non inte Uectuai skills, then the people with low 
inrellectual skills will, on average, also have depressed nonintellecrual job 
skills. 

24. Por examplcs of affirmative action programs in public bureaucracies, sec 
Lynch 1991, pp. 24~32; Taylor 1992, Ühaps. 4, 5. 

25. Carlson 1993. 

26. Carlson 199.3, p. 28. 

27. Carlson 1993, p. 30. 

28. \X4ishmgton Post, October 24-28, 1993. 

29. lOelatrre 1989; Sechrest and Bums 1992. 

30. Among the orher stories we have located linking poor worker performance 
to hiring under affirmative action requirements are one reporring an in-- 
crease in collisions and orher accidents on the New York public trans- 
portation System (K. Foran, “TA lax on Safety,” Netesclay, Sept. 19, 1990, 
p. 5), another descrihing the rise in criminal hehavior among Detroit’s po- 
lice officers (E. Salholz, “Going After Detroits rogue cops,” Newsweek, 
Sept. 5, 1988, p. 37), and one discussing the much higher rate of firings 
among Bostons black postal workers, compared to white workers (B. 
McAllister, "Researchers say Postal Service tried to block article on fir¬ 
ings," Washington Post, Oct. 17, 1992, p. A3). 

31. Silberberg 1985. See also Ford et al. 1986; Kraiger and Ford 1985. 

32. Silberberg has his own interesting hypotheses ahout these differences, 
which we do not elaborate here. Nothing in his account is at variance wirb 
nur conclusion that affirmative action procedures are exacting a cost in 
worker performance. 

33. Hacker 1992, p. 25. 

34- ln fact, that was precisely the excuse often given by the nvajor leagues for 
not hiring blacks. 

35. For a detailed Statement of this perspective, see Kelman 1991. 

36. Quoted in Bo lick 1988, p. 49. See also Taylor 1992, p. 126. 

37. There is a presumption that if we cannot explain a group difference, it is 
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appropriate to assume that there is no good reason for ir. This is had logic. 
Not knovving a good reason for a difference is nor the same as knowing rhar 
there is no good reason. 

38. We understand the argument that, in the long term, and taking the hroad- 
est possible view, if all businesses were to behave in “socially responsihle” 
ways, there would resuit a hetter soeiety that would provide a healthy di' 
mate for the businesses themselves. Our argument issomewhat more direct: 
Can a university President, thinking realistieally about the foreseeahle fii' 
tu re, see that his university will he hetter qua university by admitting so me 
students who are academically less qualified than their competitors? Gciv 
erally, yes. Can the ownerof a business, thinking realisrically ahout the fore- 
seeahle future, see that his business will be hetter qua business by hi ring 
people who are less productive than their competitors? General ly, no. 

39. D. Pitt, “Despite revisions, few blacks passed police sergeant Lest,” New 
York Times, January 1 3, 1989, p. 1. 

40. See Taylor 1992, pp. 129-1 37, for an accounr of some of rhe more egre- 
gious examples. 

41. The largest difference, 1.6 SDs, was for persons with advanced degrees. For 
Latinos, the gap with whites ranged from .6 to 1.0 SDs. 

42. Other approaches for contending with affirmative action constraints have 
surfaced. For example, New Yorks Sanitarion Department used a test on 
whieh 23,078 applicants our of 24,000 got perfect scores, and irs Firc De¬ 
partment used a test with multiple choice questions for whtch a poinr of 
credit was gtven if the first choice is correct, a halt'-poim if rhe second 
choice is correct, or a quarter-point if the rhird choice is correct, therehy 
inflating the grades for people who get lots of items wrong (Taylor 1992). 

43. Hartigan and Wigdor 1989; Hunter and Hunter 1984- 

44. For an accounr, see Hartigan and Wigdor 1989. 

45. E. F. Wonderlic & Associates, 1983, Tahle 18, P . 25. The scores of Asmns 
are lower rhan the national mean (in conrrast to resultsoflQstudies) prolv 
ahly hecause the Wonderlic, a pencil-and-paper test, is language sensitive 
and is widely used for lower-level johs. It seems likely rhar substantial pro- 
porrions of Asians who take the Wonderlic are recent immigranrs for whom 
English is a second and offen newly acquired language. 

46. Summarized in Lynch 1991. See also Detlefsen 1991. 

('hapter 21 

L Kaus 1992. Kauss analysis runs parallel with our own in many respects_ 

among orher things, in his use of the Herrnstein syllogism (Herrnstein 
1971, 1973) to think ahout the stratifymg influence of inrelligence. 


2, The remark appeared in rhe manuscript of T/ic End of Equality. Ir is used 
here with permission of the author. 

3. Quoted inNovak 1992, p. 24- 

4- Surveys by the Roper Organization (Roper Reports 92-5), as reported in 
American Enterprise (May-June 1993): 86. 

5. U,S. Bureau of the Census 1992, Tahle B-6, 1975. 

6. U.S. Bureau of the Census, 1991, Tahle IV.3. All data are hased on pretax 
income, so the tax reforms of the 1980s arc not implicared. 

7. Reich 1991. 

8. Voring estimated from Jennings 1991, Tables 7, 10, 13. 

9. Overall, 19.2 percenr of children born to NLSY women from rhe 
mid- 1970s through 1990 were horn to unmarried mothers with belovv- 
averago IQs. The national illegitimaey ratio grew sleadily throughout that 
period. 

10. “White” includes hirths ro Caucasian Latinos. The National Center for 
1 iealth Statistics has provided Latino/non-Latino breakdowns only since 
1986. Düring rhar period, rhe non-Latino white illegitimaey ratio in- 
creased from 13.2 percent to 18.0 percent in 1991, the latest figures as we 
write. 

1 l. Data refer ro poverty in the year prior to birth, and to non-Latino and 
Latino whites comhined, to be consistenr with the use of “white“ in rhis 
discussion. The proportions fr>r non-Latino white women above and he- 
low rhe poverty line were quite similar, however: 6 percent and 44 percenr 
respecrively. 

12. Unpuhlished derailed tables for Bachu 1993, available from rhe Bureau of 
rhe Census. 

1 3. These continue to he figures for Latino and non-Latino whites comhined. 
The figures for non-Latino whites may be found in Chapter 8. They are 
not so different (hecause non-Latino whites st) dominate the total). 
Seventy-two percent of illegitimate children of non-Latino white mothers 
in the NLSY had IQs below 100, and 39 percenr had IQs below 90. 

14. Wilson 1987. For a comp le men tary view, see Massey and Den ton 1993. 

15. ln the NLSY, blacks from the lowest quarrile of socioeconomic hackgmund 
had a mean IQ equivalent of 82. 

16. For an early sraremenr of this argument, see Murray 1988a. 

17. Jencks and Peterson 1991. 

18. Chapter 16 discussed some of these efforts with regard to intelligence. For 
hroader-ranging assessments, sec Murray 1984; Stromsdorfer 1987; Rossi 
1987; Glazer 1988. 
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Chapter 22 

1. The phrasing draws from Rawls 1971, pp. 14-15. 

2. For discussion of this transformation, see, for example, Brown 1988. 

3. Thomas Hobbes posrulated an axiom—Hobbes saw ir as literally an ax- 
iom, in che mathematical sensc—for governing people wirb equai rights 
to liherty: “That a man be willing, when others are so roo . . . to lay down 
this right to all things; and be contented with so much liherty against other 
men, as he would allow other men against himself.” Hobbes 1651, Chap. 

14- 

4- Hobbes expressed the gloomy prospect of perfect anarchy in the one sen- 
tence for which he is best rememhered: “And the life of man [would bei 
solitary, poore, nasty, brutish and short.” Hobbes 1651, Chap. i 3. 

5. Locke 1689, Second Treatise, sec. 4. 

6. Locke 1689 Bk. IV, Chap. XX. 

7. See, for example, Wi 11s 1978; Beer 1993. 

8. Mayo 1942, pp. 77-78. 

9. Costopoulos 1990, p. 50. 

10. Costopoulos 1990, p. 47. 

11. Mayo 1942, P . 78. 

12. Costopoulos 1990, p. 47- 

13. Quoted in Diamond 1976, p. 16. 

14. Costopoulos 1990, p. 48. 

15. That iact, combined with the “irresistihle eorruption” that Adams saw as 
infecting all political Systems, caused him to be deeply pessimisric about 
the survival of the experiment in human government that he had heen so 
instrumental in founding. He soinetimes wondered gloomily wherher a 
hereditary aristocracy on the British model mighr he necessary to offset the 
unrestrained avarice and factiousness of Jefferson s natural aristocracy. 

16. Aristotle 1905 ed., p. 207. 

17. Hamilton er al. 1787, No. 10. 

18. White 1958, p. 122. 

19. Huber 1988; Olson 1991. 

20. Bureau of Lahor Sratistics 1982, Table C-23, 1989, Table 42. 

21. In 1990 dollars in all cases: the annual income of male year-round, full- 
time non farm, non-mine laborers was $16,843 in 1958. (SALLS 1970, Table 
347). The eomparable earnings for“handlers, equipment cleaners, helpers, 
and laborers” in 1991 was $16,777- U.S. Bureau of the Census, 1992, Table 
32. The full-time weekly earnings of'“lower-skilled labor" in 1920 was $ 169 
in 1990 dol lars, or $8,459 for a fifty-week year (U.S. Bureau of the Census 
1975, Series D 765-778). 


22. For a full presentation of the following argument, see Murray 1988b, Chap. 

12 . 

21. Wilson 1993. 

24- It is douhtless harder even for hright people to lead law-abiding lives when 
the laws becorne more complex, but the marginal effects will he smalleron 
them than on the less hright. 

25. Ellwood 1988. 

26. For an accessible discussion of the pros and cons of the E1TC, see Kosters 
1993. A more ambitious approach that we think deserves consideration 
would re place the entire structure of federal rr ansfers to individuals—in- 
come Supplements, welfare, in-kind henefits, farm suhsidies, and even so¬ 
cial security—with a negative income tax of the kind proposed by Milton 
Friedman in Friedman 1962. Like Friedman, we are attracted to this strat- 
egy only if it replaces everything eise, a possibility so unlikely that it is hard 
ro talk ahout seriously. This does not Jiminish its potential merit. 

Appendix i 

1. The figure depicts 250 18-year-old males dravvn randomly from the NLSY 
sample. 

2. Based on the NLSY suhjects, born from 1957 through 1964, as of 1982, 
when rhe youngesr was 18 years old, rhe mean heighr of Contemporary 
Americans is a little over 5 feer 7 inches, with a Standard deviation of about 
4 inches. 

3. Based on the 1983 FTS norm study (Braun and King 1987) and dropour 
rares in the 1980s, we estimate the mean for all 18-year-okls (including 
dropouts) at 125, with an Standard deviation of 105. This would indicate 
that the 99th centile hegins at a score of 569. The example in the text is 
phrased eonservatively. 

4. The Pearson’s r is .501 in both cases. The numher 3,068 refers to males 
with weight and height data in 1982. 

5. For simplicity’s sake, we are assuming that the variables can have only lin¬ 
ear relationships with each other. 

Appendix 2 

1. The NLSY on CD-ROM disk is available for a nominal fee from the Cen¬ 
ter for Human Resource Research, Ohio State University. 

2. Inquiries should be direct cd to Prof. Richard J. Hermstein, Department of 
Psychology, William James Hall, Harvard University, Cambridge, MA 
021 38, or ro Dr. Charles Murray, American Enterprise Institute, 1150 17th 
Sr. NW, Washington, DC 20036. 
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3. Data für 1991 had become available in time to be used for rhe analysis, but 
for budgetary reasons, the NLSY had to cut the supplementary sample of 
luw-income whites as of 1991. We decidcd that the advanrages of indud- 
ing low-income whites in the analysis outweighed the advanrages of an ad¬ 
ditional year of data. 

4- We followed the armed forces 1 convention of limiting suhtest scores to a 
maximum of three Standard deviations from the mean. We gratefully 
acknowledge the assistanee of Dr. Malcolm J. Ree, who led the revision of 
the AFQT, in computing the revised scores for the NLSY. 

5. Th Ls procedure is facilitated by the large sample sizes (at least 1,265 wich 
valid AFQT scores in each hirth year, which are as large as the samples 
commonly used for national norms in tests such as the WISC5 and WAIS), 
and the fact that the NLSY sample was halanced for ethnic group and gen- 
der within birth years. 

6. We also experimented with groupings based not on the calendar year, but 
the schon] year. The differences in centile produced by rhe rwo procedures 
were never as much as two, so we remained with calendar year as the ha- 
sis. 

7. See Users Guide 1993, pp. 157-162. 

Appendix 3 

1. The subtests are General Science (GS), Arithmetic Reasoning (AR), 
Work Knowledge (WK), Paragraph Comprehension (PC), Numerical Op- 
erations (NO), Coding Speed (CS), Auto/Shop Information (AS), Math- 
envatics Knowledge (MK), Mechanical Comprehension (MC), and 
Electronics Information (EL). Two subtests (Numerical Operarums and 
Coding Speed) are highly speeded; the other eight are “power” rather than 
speed tests. 

2. Ree and Earles 1990a, 1990b, 1991c. 

5. We tise rhe term facun in a generic sense. Within psychometrics, terms 
like f(]ctüT and component are used selectively, depending on the particular 
method of analysis used to extract the measures. 

4. E.g., Gould 1981. 

5. Jensen 1987a, 1987b; Ree and Earles 1991c; Welsh, Watson, and Ree 1990. 

6. To account for literally 100 percent of the variance takes ten factors (be- 
cause there are ren subtests), with the final few of rhem making increa; 
ingly negligible contributions. In the case of ASVAR, the final five factors 
collectively account for only 10 percent of the total variance in scores. 

7. Sperl, Ree and Steuck 1990. 

8. Carroll 1988; Jensen 1987a. 


9. Ree and Earles, 1990a, 1990h, 1991c. 

10. Gordon 1984; Jensen and Figueroa 1975. 

11. Note rhat rhe General Science suhtest and the Electronics Information 
suhtest are as highly g-loaded as the subtests used in the AFQT. Why not 
use them as well? Because they draw on knowledge that is specific to cer- 
tain courses that many yourhs might not have taken, whereas the mathe- 
matics and read ing subtests require only material that is ordinarily covered 
in the courses taken by every Student who goes to elernentary and sec¬ 
ondary school. But this is a good illustration of a phenomenon associated 
with IQ tests: People who acquire knowledge about elecrronics and Sci¬ 
ence also tend to have high mathematics and verbal ability. 

12. Jensen 1980, Table 6.10. 

15. Within a single tesr, the fest score might mean any of several percentile 
scores, depending on rhe age of the Student; hence rhe reason for using per- 
centiles. For the analyses in the text, scores were used only if hoth a tost 
score and a percentile were recorded. Anomalous scores were discarded as 
follows: For the California Test of Mental Maturity, one test score of 700. 
For rhe Otis-Lennon Mental Ability Test, eighr cases in which the test 
score was under 30 and the percentile was over 70; one case in which the 
tesr score was 176 and the percentile was only 84- For the Henmon-Nel- 
son Test of'Mental Maturity, one test score ot 574- For the Differential 
Aptitude Test, sixteen test scores over 100. For the Lorge-Thorndike In¬ 
telligence Test and the Kuhlmann-Anderson Intelligence Test, which 
showed uninterpretahle scatter plots of tesr scores against percentiles, cases 
were retained if the test score normed accorcling to a mean of 100 and a 
Standard deviation of 1 5 was within 10 centiles of the reported percentile 
score. The number of eligihle scores on the Stanford-Binet and the Wechs¬ 
ler Intelligence Scale for Children (18 and 16, respectively) was too small 
to analyze. 

14- Jensen 1980, Table 8.5. 

15. This list is taken from Jensen 1980, p. 72. Jensen devotes a chaprer (Chap. 
4) to the distribution of mental ability, which we recommend as an excel- 
lent single source for readers who want to pursue this issue. 

16. For an exploration of the relationships as of the late 1960s, see Jencks et 
al. 1972, Appendix B. For separate studies, see Rutter 1985; Haie, Ray¬ 
mond, and Gajar 1982; Wolfe 1982; Schiff and Lewontin, 1986- 

17. Hus£n and Tuijnman, 1991. See also Ceci 1991, for a case that schooling 
has a greater influence on IQ than has generally heen accepted, drawing 
heavily on data from earlier decades when the natural Variation in school¬ 
ing was large. 
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Appendix 5 

1. Valiclity is measured by the correlation between predictor and ourcome, 
which, multiplied by the ratio of the Standard deviations of the ourcome 
to the predictor, gives the regression coeff icient of the outcomc on the pre- 
Victor. To keep this discussion simple, wo assume an increasing monotonic 
relationship between the validity and the regression coefficient here. For 
a discussion that does not make this simplifying assumption, see Jensen 
1980, 

2. ln the following sources, one can find varying estimates of the magnitude 
oi predictive validity of intelligence tests and varying opinions abour 
wherher the tests are a net henefir ro sociery, hat they unanimously accept 
the conclusion that no bias against hlacks in educational or oceupational 
predictionhas been found: Breland, 1979; Crouse and Trusheim 1988; Har- 
riganand Wigdor 1989; Hunter and Schmidt 1990; Jensen 1980; Klitgaard 
1985; Reynolds and Brown 1984; Schmidt 1988. 

3. For a discussion of the sources of error and their relevance ro meta- 
anal yses of oceupational ourcomes in particular, see Hunter and Schmidt 
1990, For a more general discussion, including educational ourcomes, see 
Jensen 1980. 

4. Jensen 1984b, p. 523. 

5. Occasionally, one may find a study that finds differential predictive valid- 
iry for one ethnic grotip or another for a particular test—e.g. ( the K-ABt 1 
test for Latinos and non-Latino whites (Valencia and Rankin 1988). Bur 
even for Latinos, validity generalization hasgenerally been confirmed (e.g., 
Reynolds and Gurkin 1980; Valdez and Valdez 1983). 

6. Jensen 1980, Table 10.4. 

7. Breland 1979, Table 3h. 

8. Ihid. 

9. Hartigan and Wigdor 1989, Table 9.5. 

10. Ihid. 

11. The example given here is a special case of a more general phenontenon: 
As long as the product of the regression coefficienr (which is assumed not 
to diffei for rhe groups) and the mean difference between groups in the pre- 
dictor is smaller than the mean difference in rhe outcome, there will he 
overprediction for the lower-scoring group. 

12. Fora reviewof the literature rhrough the early 1980s, see Jensen 1985, also 
discussed in Chapter 13. For srudies since then, see Braden 1989; Jensen 
1992, 1993b. The single contrary study extant is Gusrafsson 1992. 

13. McGurk 1951. Also in 1951, Kenneth Eeliss doctoral thesis at the Uni- 
versity of Chicago showed that test item difficulty did notvary nuich across 
white ethnics of different types, rherehy failing to support the intuirion 


that cultural factors are dominant (Hells et al. 1951). See Jensen 1980, 
Chap. 11, for more on McGurk’s and Eeliss werk and on other early srud¬ 
ies of resr item bias. 

14. For a review of the literature through the late 1970s, see Jensen 1980, 
Chap. 11. For studies since 1980, see Barr er al. 1986; Ross-Reynolds and 
Reschly 1983;Sandoval etal. 1983; Jensen and McGurk 1987; Cook 1987; 
Koh, Ahhatiello, and Mcloughlin 1 984; Reschly and Ross-Reynolds 1982; 
Mishra 1983. All found no item differences, or differences that explained 
only a fraction of the differences in group sc o res. Are there any exceptio ns? 
We identified one such study for hlacks (Montie and Fagan 1988), hased 
on 3-year-olds. There may very well be orher studies oi similar size (the 
sample in Montie and Fagan was 86) that are lurking in the literature, but 
we know of no studies using large-scale repräsentative sainples that estab- 
lish item bias against hlacks. Some studies of Latinos have found evidence 
of bias, mostly associated with Spanish and English language characteris- 
tics, See Valencia and Rankin 1988; Whitworth andChrisman 1987, Mun- 
ford and Munoz 1980, But the factor structure of the test resuits has 
generally been found to be the same for Latino and non-Latinos (e.g., see 
Mishra 1981). 

15. See Jensen 1980, Table 11.12. Also see Miele 1979. 

16. Scheuneman 1987. 

17. For a literature review, see Jensen 1980, Chap. 12. 

18. Dyer 1970. 

19. For studies specificaily dealing wirh differential racial effects of coaching 
and practice through the late 1970s, see Baughman and Dahlstrom 1968; 
Cosrello 1970; Dubin, Osburn, and Win ick 1969; Jensen 1980. For stud^ 
ies bearing on rhe issue since 1980 (but not addressing it as directly as the 
earlier ones), see Powers 1987; Terrell andTerrell 1983; Johnson and Wal¬ 
küre 1989; Cole 1987. 

20. For literature reviews, see Sattler and Gwynne 1982; Jensen 1980. 

21. For a literature review, see Jensen 1980, Chap. 12. 

22. For a literature review, see Jensen 1980, Chap. 12. 

23. Jensen 1980, Chap. 12. See also note 14 regarding item bias for Latinos. 

24. Jensen 1980, Chap. 12. 

25. Quay 1971, 1972, 1974. 

26. Farre 11 1983 and the attached responses. 

27. Johnson et al. 1984; Frederiksen 1986; Johnson 1988; Kerr et al. 1986; 
Madhere 1989; Scheuneman 1987; White et al. 1988 

28. Rock et al. 1985 details the changes between the two administrations, con- 
cluding that “the cautious posirion wouId he that neither administration 
had an advantage. A less cautious conclusion is rhar rhe 1980subjects prob- 
ahly had .some small advantage” (p. 18). 
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29. Based on the white Standard deviation for 1980, the first year that Stan¬ 
dard deviarions by race were published. 

30. Congressional Budget Office, 1986, Fig. E-3. 

31. Contrary to populär belief, on the proposition whether brain size is cor- 
related wich IQ, the evidence stmngly favors the pros over the cons, 
even after correcting for stature. A sampling of Contemporary positions 
in thi.s mini-controversy is Cain and Vatulerwolf 1990; Gould 1978, 
1981; Lynn 1989; Michael 1988; Passingham 1982; Rushton I990d, in 
press; Valen 1974. Brain size is, however, not necessarily wholly derermined 
by the genes; it could also he associated with nutririon or general healrh. 

32- The Rushton controversy has unfolded in a rapidly expanding scholaily 
literature. Some of the papers, pro and con, are Cain and Vanderwolf 1990; 
Lynn 1989b; Roberts and Gabor 1990; Rushton 1985, 1987, 1988, 1990a, 
1990h, 1990c, 1990d, 1991a, 1991b; Rushton and Rogaert, 1978, 1988; Sil ’ 
verman 1990; Weitzmann et al. 1990; Zuckerman and Brody 1988. For fur- 
ther substantiation of some of the race differences that Rushton invokes, 
see Ellis and Nyborg 1992; Lynn 1990c; Mangold and Powell-Uriner 1991 ; 
Rowe, Rodgers, and Meseck-Bushey 1988; Valen 1974. 

>3. Almost as all encompassing a thesis as Rushton’s is Richard Lynn’s account 
of the evolution of racial differences in intelligence in terms of the ances- 
tral migrations of groups of early hominids from the relarively benign en- 
vironments of Africa to the harsher and more demanding Eurasian 
latitudes (Lynn 1991c), where they branched into the Caucasoids and 
Mongoloids. Such theories were not uncommon among anthropologists 
and biologists of a generation or two ago (e.g., Darlington 1969). As the 
biological outlook on human hehavior hecame conrroversial, rhis kind of 
theorizing has almost vanished. The modern Version relies much more on 
psychological measurements of Contemporary populations than the earlier 
version. 

Appendix 7 

1- For a coniprehensive discussion, see Epsrein 1992. 

2. Any one oi t hese court eases may involve heroic efforts: “Some courts have 
expressed concern ar rhe specracle of rrials lastingfor weeks, following years 
of discovery, and involving a multitude of Statistical and other experts and 
seemingly endless testimony about rhe credentials of a single [job] candi- 
date. M Bartholet 1982, p. 1002. 

3- Quoted in Patterson 1989, p. 87. 

4. Patterson 1989, 

5. Patterson 1989. 

6. 401 U. 8,424 (1971). 


7. Lynch 1991; Murray 1984; Patterson 1989. 

8. For a clear account, see Patterson 1989. 

9. 401 U.S. 432. 

10. Ibid. 

11. There is good evidence thar the Duke Power Co. had no discriminatory 
infent in using the test or the educational credential; it was using the same 
critena at a time when it was frankly pursuing a race-segregationist hiring 
policy. This earlier conduct gives credence to its claim that it wanted to 
improve its empE^ees’ intellectual level. 

12. Some legal scholars criticize the Court for not having interpreted the Con¬ 
stitution itself, in the Fourteenrh Amendment, as providing protection 
againsr disparare impact (e.g., Trihe 1988). 

13. Ironically, the particular wording in the relevant part of Title VII was an 
accommodation to one of the act s most uneasy opponents, Senator John 
Tower of Texas, who was concemed that the law not be used in precisely 
the manner that, in Griggs, the court. ruled that it should be used (Wilson 
1972). 

14- For an excellent discussion, see Epstein 1992, whose reading of the record 
strongly confirms ours. Epstein makes the point that had the Congress 
known in 1964 whar interpretation the Court was to place on Title VII in 
Griggs, it “would have gone down to thunderingdefeat” (p. 197). From the 
legislative record, that appears to us to be a fair assessment. 

15. Quoted in Wilson 1972, PP . 854ff. 

16. Quotes attributed to S. Rep. 92-415, 92d Gong., Ist sess. 5 (1971), the re- 
port of the Senate Committee on Labor and Public Welfare, in Patterson 
1989. 

17. Wilson 1972. 

18. Bartholet 1982, p. 958. 

19. 422 U.S. 405 (1975). 

20. Our discussion here has drawn on Braun 1992. 

21. Courts other than the Supreme Court have imposed on the employer it¬ 
self the bürden of seeking less discriminatory alternatives (Patterson, 
1989). 

22. For references to the relevant government documents, see Patterson 1989. 

23. For a similar conciusion, and some detail to back ir up, see Porter 1986. 

24- 490 U.S. 642 (1989). 

25. 490 U.S. 659. 

26. Cathcart and Snyderman 1992. 
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From 


“This book is about differences in intellectual capacity 
among people and groups, and wbat those differences 
mean for America's future. The relationships we will 
be discussing are among the most sensitive in Contem¬ 
porary America—so sensitive that hardly anyone writes 
or talks about them in public. It is not for lack of Infor¬ 
mation, as you will see.” 

“To try to come to grips with the nation's problems 
without understanding the role of intelligence is to see 
through a glass darkly indeed, to grope with Symptoms 
instead of causes, to stumble into supposed remedies that 
have no chance of working.” 

“We are not indifferent to the ways in which this book, 
wrongly construed, might do harm. We have worried 
about them from the day we set to work. But there cän 
be no real progress in solving America's social problems 
when they are as misperceived as they are today. What 
good can come of understanding the relationship of 
intelligence to social structure and public policy? Little 
good can come without it.” 
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